Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Sure, I'm just answering your question of what people are benchmarking and it's not elixir. You could be the person that benchmarks LLMs in niche languages and shows how bad they are at it.

If your benchmark suite became popular enough and folks referenced it, the people training the LLMs would most likely try to make the model better at those languages.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: