Is anyone well-read in the DS-R1 tea leaves and feels confidant they know what the distillation method used was? It's not clear to me if they mean "train on data from another model" or something that I'd consider "actually distilling"? My current guess is synthetic CoT? - ThreadSky

stellaathena.bsky.social • 83 days ago

Is anyone well-read in the DS-R1 tea leaves and feels confidant they know what the distillation method used was? It's not clear to me if they mean "train on data from another model" or something that I'd consider "actually distilling"?

My current guess is synthetic CoT?

Comments

Posting Rules

Comments

Posting Rules

Reply