DeepSeek, a LLM trained for a fraction of the cost of GPT-Xx models, in 2 months for 6 million, on limited GPUs due to export restrictions, and competing head to head. This is crazy.

It's not the AI part I'm excited about, it's the level of efficiency. https://github.com/deepseek-ai/DeepSeek-V3

Comments