"LLMs can't reason, look at how their accuracy drops if you change the numbers in the problem!!!"
The accuracy drop (%):
The accuracy drop (%):
Comments