ThreadSky
About ThreadSky
Log In
quinta.mastodon.uno.ap.brid.gy
•
5 days ago
Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation
https://blog.quintarelli.it/2025/02/can-we-trust-ai-benchmarks-an-interdisciplinary-review-of-current-issues-in-ai-evaluation/
Comments
Log in
with your Bluesky account to leave a comment
No comments yet
Posting Rules
Be respectful to others
No spam or self-promotion
Stay on topic
Follow Bluesky's terms of service
×
Reply
Post Reply
Comments