Researchers: this is _not_ how you evaluate LLMs

https://www.nature.com/articles/s41467-024-55628-6
Post image

Comments