New preprint! Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs.

In this paper with @kesnet50.bsky.social and my advisor Armando Solar-Lezama, we investigate how LLMs perform on randomly selected simple language reasoning problems.

https://arxiv.org/abs/2501.02825

Comments