So, is the TL;DR of the DeepSeek paper basically they taught a model how to use critical "thinking" skills rather than just brute force training? Am I missing something here where basically AI engineers realised that AI learning could excel if it used transferable skills like human learners?
Comments