Structured outputs can decrease LLM's performance in some tasks

I replicated @willkurt.bsky.social / @dottxtai.bsky.social rebuttal of Let Me Speak Freely? (LMSF) using gpt-4o-mini

The rebuttal correctly highlights many flaws with the original study, but ironically, LMSF's conclusion still holds
Post image

Comments