sashaboguraev.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

sashaboguraev.bsky.social

PhD student @UT_Linguistics | prev. CS, Math, Comp. Cognitive Sci @cornell

6 posts 106 followers 235 following

comment in response to post

Do you have any thoughts on whether these a) emerged naturally during the RL phase of training (rather than being specifically engineered to encourage more generation or an artifact of some other post-training phase) and if so b) actually represent backtracking in the search?

submitted 20 days ago

comment in response to post

I'm curious as to what you think of the explicit backtracking in the reasoning model's chains of thoughts? I agree that much of the CoT feels odd and unfaithful, but also there's something that feels very easily anthromorphizable in the various “oh wait”s, and “now I see”s.

submitted 20 days ago

comment in response to post

Been spending sometime over break making my way through the Bayesian Models of Cognition book — great read.

submitted 65 days ago