I love research using first principles because one of my students was working on this applied research project on limited data and compute and ended up inventing a cool new fine tune scaling lemma as a side quest! Very Deepseek energy 🙂‍↕️

Comments