A few people have responded to this paper by saying "well people freaked out about calculators too".
It's a funny comparison because people *are* using chatbots as calculators - and they suck. This tested the accuracy of an LLM to perform simple multiplication
https://xcancel.com/yuntiandeng/status/1889704768135905332/photo/1
It's a funny comparison because people *are* using chatbots as calculators - and they suck. This tested the accuracy of an LLM to perform simple multiplication
https://xcancel.com/yuntiandeng/status/1889704768135905332/photo/1
Comments
https://arxiv.org/html/2405.14838v1
Is one of the papers the researchsr was working on in relation to this concept
For example. Yet researchers also want to analyze different methods at solving this problem so that it could interect with different systems and also so we can understand it and perhaps ourselves better too
I can't even tell what the chart is actually measuring but I presume it's more of a histogram than a measure of proportional error.
https://arxiv.org/html/2405.14838v1
https://ercim-news.ercim.eu/en136/special/chatbots-socrates-dialogues-in-learning
https://link.springer.com/chapter/10.1007/978-3-031-75599-6_1#Abs1