nsaphra.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

nsaphra.bsky.social

Waiting on a robot body. All opinions are universal and held by both employers and family. Recruiting students to start my lab! ML/NLP/they/she.

1,992 posts 9,304 followers 1,386 following

Posts 16 Comments 34

Why can’t LMs solve puzzles about the number systems of languages, when they can solve really complex math problems? Our new paper, led by @antararb.bsky.social looks at why this intersection of language and math is difficult, and what this means for LM reasoning! arxiv.org/abs/2506.13886

submitted 2 hours ago • 1 comment

My partner & I were trying to ID a bird we heard while walking and the guy who had stopped his car to let us pass got out and was like "it's a frog, actually. Cope's Gray. I do frog surveys... Sorry" And then got in his car without a word and drove off. Absolute king

submitted 20 hours ago • 338 comments

New research from MIT found that those who used ChatGPT can’t remember any of the content of their essays. Key takeaway: the product doesn’t suffer, but the process does. And when it comes to essays, the process is how they learn. arxiv.org/pdf/2506.088...

submitted 13 hours ago • 17 comments

An easy way to tell low-information AI skepticism apart from informed skepticism: extremely confident beliefs about cognition, reasoning, and learning in real brains. (Cog/neuro)scientists don’t know how intelligence develops, but you’re convinced prediction objectives have no value?

submitted 17 hours ago • 1 comment

What is common knowledge in your field but shocks outsiders? Digital resources, particularly eBooks and audiobooks, are going to bankrupt libraries if something isn't done to halt the extortionary pricing models of publishers.

submitted 1 day ago • 7 comments

When I was at Sheryl Sandberg’s FB lady intern BBQ I met “Cynthia” and we talked about her cool environmental engineering work in the data centers. When Cynthia scheduled an intern goodbye concert, I discovered that was the real name of Vienna Teng, who I’d been listening to since high school.

submitted 1 day ago • 7 comments

the load-bearing institution we broke first was school. this is very weird and we won't know how much it matters for years

submitted 2 days ago • 11 comments

to anyone who needs further proof that Zohran is the abundance candidate over Cuomo: Zohran wrote a bill giving the MTA streamlined permitting authority, which would speed up and reduce the costs of subway construction

submitted 2 days ago • 12 comments

Excited to share this project specifying a research direction I think will be particularly fruitful for theory-driven cognitive science that aims to explain natural behavior! We're calling this direction "Naturalistic Computational Cognitive Science"

submitted 2 days ago • 3 comments

The Power Broker. Man spent my whole childhood complaining about the fall of New York every time we drove on the Cross Bronx Expressway, West Side Parkway, the Henry Hudson Parkway, the Triborough Bridge,

submitted 2 days ago • 0 comments

If a human therapist routinely failed to pick up on clear signs of suicidality, or was regularly unable to separate patient delusions from reality, they wouldn't be allowed to practice. These outcomes are dangerous. futurism.com/stanford-the...

submitted 4 days ago • 5 comments

"Over four months, LLM users consistently underperformed at neural, linguistic, and behavioral levels." arxiv.org/abs/2506.08872

submitted 3 days ago • 28 comments

Instead of like buttons we should have pike buttons and maybe also other types of fish buttons

submitted 5 days ago • 9 comments

If you're at #RLDM2025, check out our contributed talk at Session 3 (Fri 6/13, 12:10pm), presented by my brilliant co-first-author on this project @sjohnsonyu.bsky.social! Wasn't able to make it in person, but would love to hear your thoughts @kempnerinstitute.bsky.social @harvardmed.bsky.social

submitted 6 days ago • 0 comments

ACL paper alert! What structure is lost when using linearizing interp methods like Shapley? We show the nonlinear interactions between features reflect structures described by the sciences of syntax, semantics, and phonology.

submitted 6 days ago • 3 comments

Reasoning is about variable binding. It’s not about information retrieval. If a model cannot do variable binding, it is not good at grounded reasoning, and there’s evidence accruing that large scale can make LLMs worse at in-context grounded reasoning. 🧵

submitted 6 days ago • 4 comments