I think a lot of these math benchmarks are absurd. It seems people are chasing the score without thinking about what we're actually trying to do.
Not getting 100% on 3 digit times a 3 digit but getting a 6 digit x 4 digit should make us question everything.
Something fundamental is wrong.
Not getting 100% on 3 digit times a 3 digit but getting a 6 digit x 4 digit should make us question everything.
Something fundamental is wrong.
Comments
Note: the original image is a gif. All scores start bad but improve. They improve top left to bottom right.
Rules still aren't learned
Benchmarks are far from enough and the better we get the harder they are to evaluate. Benchmarks mean nothing if you don't understand their purpose or limitations.