Been looking forward to this! I particularly like her apposition of o3's performance with strategies utilized by previous ARC-AGI winners, which left her "a bit disappointed: each of these methods went against the assumptions that I described above that, for me at least, made ARC so attractive"
Reposted from
Melanie Mitchell
Some of my thoughts on OpenAI's o3 and the ARC-AGI benchmark
aiguide.substack.com/p/did-openai...
aiguide.substack.com/p/did-openai...
Comments