"When a measure becomes a target, it ceases to be a good measure"
(Goodhart's law)

The story goes as follows: someone made a test to check how capable LLMs were. They didn't do well.
But then new LLMs were created and trained specifically to pass the test, and lo and behold, they actually […]

Comments