I’m always going to be skeptical of any such claims from OpenAI itself, because they have a strong financial incentive to declare AGI as soon as possible (since this frees them from the Microsoft deal).
The ARC scores look promising but it may also be an extreme case of benchmark maxxing.
Comments
The ARC scores look promising but it may also be an extreme case of benchmark maxxing.