BIG release by DeepSeek AI🔥🔥🔥 DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community! huggingface.co/deepseek-ai huggingface.co/deepseek-ai/... - ThreadSky

adinayakup.bsky.social • 39 days ago

BIG release by DeepSeek AI🔥🔥🔥

DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community!
https://huggingface.co/deepseek-ai
https://huggingface.co/deepseek-ai/DeepSeek-R1

Comments

Posting Rules

Comments

Posting Rules

Reply