Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation https://blog.quintarelli.it/2025/02/can-we-trust-ai-benchmarks-an-interdisciplinary-review-of-current-issues-in-ai-evaluation/

Comments