Profile avatar
thomwolf.bsky.social
Co-founder @huggingface
89 posts 10,867 followers 1,415 following
Prolific Poster
Conversation Starter

Hugging Face just entered the top 10 organizations on @github.com Close to 500,000 GitHub stars across our open-source libraries! Couldn't be more proud of what this 220-person team is accomplishing

Building real-time WebRTC and Websocket applications is very difficult to get right in Python. Until now - Introducing FastRTC, the realtime communication library for Python ⚡️ huggingface.co/blog/fastrtc

After 6+ months in the making and over a year of GPU compute, we're excited to release the "Ultra-Scale Playbook": hf.co/spaces/nanot... A book to learn all about 5D parallelism, ZeRO, CUDA kernels, how/why overlap compute & coms with theory, motivation, interactive plots and 4000+ experiments!

LLM Reasoning labs will be eating good today🍔 We commandeered the HF cluster for a few days and generated 1.2M reasoning-filled solutions to 500k NuminaMath problems with DeepSeek-R1 🐳 Have fun!

OK! My Google colleague Thang Luong shared some exciting updates about AlphaGeometry2! AG2 now has surpassed the average gold-medalist in solving Olympiad geometry problems, w/ a solve rate of 84% compared to 54% previously! Paper: arxiv.org/abs/2502.03544 See full list of authors on link

From an open-research point of view, maybe the greatest thing about DeepSeek–R1 is how its RL training technique appears so straightforward/simple in comparison to the cumbersome approaches we were starting to think necessary for reasoning like Process Reward Models or Monte Carlo Tree Search. [1/2]

We've just released the new Spaces search and it's quite mind blowing Explore over 400k AI Apps in the most intuitive way background removal, image-to-3D, comic factory, sound transcription, image editing, clothes virtual try-on, etc All made by AI builders for AI builders huggingface.co/spaces

« appending "Wait" multiple times to the model's generation » is our current most likely path to AGI :) See the fresh arxiv.org/abs/2501.19393 by Niklas Muennighoff et al.

Yes! a 15-min-read post to catch up on all experiments, reproductions, explorations around DeepSeek R1 We've summarized everything we did & saw in this first week since DS came to light Results/code/dataset/experiment => if we missed one, share & we'll update Link: huggingface.co/blog/open-r1...

I wrote some reflections on DeepSeek, open-source, AI, US and China, starting from Dario's recent essay calling for stronger export controls. I mostly disagree with his essay and think it missed the point You can read it here: thomwolf.io/blog/deepsee...

I was briefly chatting on Bloomberg earlier today about DeepSeek and open-source AI youtu.be/wjU2zTbrqQY?...

The most impactful open-source project of today (dixit Vercel VP of AI) => huggingface.co/blog/open-r1

Every time you declare open-source AI dead another team rise up like a Whac-A-Mole. Intelligence will be commoditized! The remaining moat will be the quality of the product and how it integrate with co-factors. That’s were billions dollars AI companies will be built.

The AI agent course keeps breaking records! Now breaking discord registration system You can still join at bit.ly/hf-learn-age... (let’s test discord infrastructure hahah)

Wait! Over 35 *thousands* people already registered for the AI Agent course in a few days... time to find the biggest conference center in the world and gather everyone for a giga-conference on agents? you can still join the online course here bit.ly/hf-learn-age...

🚨 I'll be in Davos next week as part of the Unicorn program at the World Economic Forum Let's meet if you're interested in leveraging/building/discussing AI!

Mind blown 👇 when people ask whether you need an agent framework at all! All evals should move to agentic evals in 2025 in my opinion. We’re just leaving so much capabilities of our models on the table. Benchmarked with smolagents: github.com/huggingface/...

Free course on Agents by Hugging Face. We just added a chapter to smol course on agents. Naturally, using smolagents! The course cover these topics: - Code agents - Retrieval agents - Custom functional If you're building agent applications, this course should help.

Yes!

What was the most impactful/visible/useful release on evaluation in AI in 2024?

New LLM evals entering and leaving the field saturated before the paper is even published in ML conferences 📸 matt_mccreary1

4 out of the 6 top trending repos on GitHub are from Chinese AI teams today 2025 is gonna look quite different from 2024

Our first release of 2025: 𝙨𝙢𝙤𝙡𝙖𝙜𝙚𝙣𝙩𝙨, 𝘁𝗵𝗲 𝘀𝗶𝗺𝗽𝗹𝗲𝘀𝘁 𝗹𝗶𝗯𝗿𝗮𝗿𝘆 𝘁𝗼 𝗯𝘂𝗶𝗹𝗱 𝗮𝗴𝗲𝗻𝘁𝗶𝗰 𝘀𝘆𝘀𝘁𝗲𝗺𝘀! 💥 Main logic in ~1000 LoC 🧑‍💻 Agent writes its actions in code! LLMs are much better at writing code than current standard of writing JSON => higher perf 🌍 Any LLM support (h/t LiteLLM) 🛡️ Secure code exec (h/t E2B)

With the new OpenAI O3 moving performance from 5% up to 25% on FrontierMath it’s time to push open-source models upwards! We're super happy to release FineMath, the best open math dataset yet. A strong baseline to start training your own models Find it in the trending section of HuggingFace ;)

what was this thing btw? "Moreover, ARC-AGI-1 is now saturating – besides o3's new score, the fact is that a large ensemble of low-compute Kaggle solutions can now score 81% on the private eval" big ensemble of heuristics?

challenge!

Want to grasp test time compute, the secret sauce behind recent AI breakthroughs? We've open-sourced the method - perfect holiday reading to understand what's powering the next wave of AI model development. 🧠✨ huggingface.co/spaces/Huggi...

Introducing 📐FineMath: the best open math pre-training dataset with 50B+ tokens! Math remains challenging for LLMs and by training on FineMath we see considerable gains over other math datasets, especially on GSM8K and MATH. 🤗 huggingface.co/datasets/Hug... Here’s a breakdown 🧵

2025 will be the year of AI for science Leveraging all the things we've recently learned training AI models for 1000x impact in science and this will need data! More details: huggingface.co/blog/lemater... Thread: ...

Big News in AI4Science! ✨ We are thrilled to launch LeMaterial, an open-source project in collaboration with @hf.co to accelerate materials discovery ⚛️🤗 Discover LeMat-Bulk: a 6.7M-entry dataset standardizing and unifying Materials Project, Alexandria and OQMD

The Open LLM Leaderboard got a new front page for Christmas Check it out at huggingface.co/spaces/open-...

When you’ve finished your day of emails

Spanish, Filipino, Amharic, French, German, Basque, Catalan, Galician, Guarani, Telugu, Italian, Pashto, Romanian, Tamil, Urdu, Danish... and many more! All included in the FineWeb2 Community Annotation Sprint! 🔥 💫 Join to build an impactful dataset for your language!

The FineWeb team is happy to finally release "FineWeb2" 🥂🥳 FineWeb 2 extends the data driven approach to pre-training dataset design that was introduced in FineWeb 1 to now covers 1893 languages/scripts Details: huggingface.co/datasets/Hug... A detailed open-science tech report is coming soon

Announcing 🥂 FineWeb2: A sparkling update with 1000s of 🗣️languages. We applied the same data-driven approach that led to SOTA English performance in🍷 FineWeb to thousands of languages. 🥂 FineWeb2 has 8TB of compressed text data and outperforms other datasets.

Four new visualisations of the rise of open-source AI models in 2024 added in our daily nuggets! - explore how various AI tasks have been growing - how community likes connect AI models together - the geography of models creators and followers Explore them here: huggingface.co/spaces/huggi...

The FineWeb team is happy to finally release "FineWeb2" 🥂🥳 FineWeb 2 extends the data driven approach to pre-training dataset design that was introduced in FineWeb 1 to now covers 1893 languages/scripts Details: huggingface.co/datasets/Hug... A detailed open-science tech report is coming soon

Announcing Global-MMLU - an improved MMLU Open dataset with evaluation coverage across 42 languages. The result of months of work with the goal of advancing Multilingual LLM evaluation. Built together with the community and amazing collaborators at Cohere4AI, MILA, MIT, and many more.