Profile avatar
sashaboguraev.bsky.social
PhD student @UT_Linguistics | prev. CS, Math, Comp. Cognitive Sci @cornell
6 posts 106 followers 235 following
Getting Started
comment in response to post
Do you have any thoughts on whether these a) emerged naturally during the RL phase of training (rather than being specifically engineered to encourage more generation or an artifact of some other post-training phase) and if so b) actually represent backtracking in the search?
comment in response to post
I'm curious as to what you think of the explicit backtracking in the reasoning model's chains of thoughts? I agree that much of the CoT feels odd and unfaithful, but also there's something that feels very easily anthromorphizable in the various “oh wait”s, and “now I see”s.
comment in response to post
Been spending sometime over break making my way through the Bayesian Models of Cognition book — great read.