sashaboguraev.bsky.social
PhD student @UT_Linguistics | prev. CS, Math, Comp. Cognitive Sci @cornell
6 posts
106 followers
235 following
Getting Started
comment in response to
post
Do you have any thoughts on whether these a) emerged naturally during the RL phase of training (rather than being specifically engineered to encourage more generation or an artifact of some other post-training phase) and if so b) actually represent backtracking in the search?
comment in response to
post
I'm curious as to what you think of the explicit backtracking in the reasoning model's chains of thoughts? I agree that much of the CoT feels odd and unfaithful, but also there's something that feels very easily anthromorphizable in the various “oh wait”s, and “now I see”s.
comment in response to
post
Been spending sometime over break making my way through the Bayesian Models of Cognition book — great read.