OpenAI's "most powerful system" makes shit up more than half of the time. For its "o4-mini" model, the rate is ***79 percent***
Reposted from The New York Times
The newest and most powerful A.I. technologies — so-called reasoning systems from companies like OpenAI, Google and the Chinese start-up DeepSeek — are generating more errors, not fewer. As their math skills have notably improved, their handle on facts has gotten shakier.

Comments