1. DeepSeek left a LOT out of their cost estimates. The $5.6M they spent on their final training run seems to be only the price tag for the time the 2,048 Nvidia H800 GPUs were running over two months. 🧵2/7

Comments