Reminder that LLMs are a dismal expensive prototype with a handful of potential applications, not a reliable general purpose technology.
Researchers tried a couple on the 2025 USA Mathematical Olympiad, a high school competition.
Best score was 2 out of 42.
https://arxiv.org/abs/2503.21934v1
Researchers tried a couple on the 2025 USA Mathematical Olympiad, a high school competition.
Best score was 2 out of 42.
https://arxiv.org/abs/2503.21934v1
Comments
2) general research, as good as a gopher when starting with a blank page
3) very specifically in my line, allowing me to go from concept to code particularly in non specialist languages.
Nobody is saying it is replacing skilful people in total.
Things like "get the text in this image". Cool! But not an end in itself.