thomwolf.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

Hugging Face just entered the top 10 organizations on @github.com Close to 500,000 GitHub stars across our open-source libraries! Couldn't be more proud of what this 220-person team is accomplishing

submitted 1 day ago • 0 comments

Building real-time WebRTC and Websocket applications is very difficult to get right in Python. Until now - Introducing FastRTC, the realtime communication library for Python ⚡️ huggingface.co/blog/fastrtc

submitted 1 day ago • 1 comment

After 6+ months in the making and over a year of GPU compute, we're excited to release the "Ultra-Scale Playbook": hf.co/spaces/nanot... A book to learn all about 5D parallelism, ZeRO, CUDA kernels, how/why overlap compute & coms with theory, motivation, interactive plots and 4000+ experiments!

submitted 7 days ago • 3 comments

LLM Reasoning labs will be eating good today🍔 We commandeered the HF cluster for a few days and generated 1.2M reasoning-filled solutions to 500k NuminaMath problems with DeepSeek-R1 🐳 Have fun!

submitted 14 days ago • 2 comments

OK! My Google colleague Thang Luong shared some exciting updates about AlphaGeometry2! AG2 now has surpassed the average gold-medalist in solving Olympiad geometry problems, w/ a solve rate of 84% compared to 54% previously! Paper: arxiv.org/abs/2502.03544 See full list of authors on link

submitted 18 days ago • 1 comment

From an open-research point of view, maybe the greatest thing about DeepSeek–R1 is how its RL training technique appears so straightforward/simple in comparison to the cumbersome approaches we were starting to think necessary for reasoning like Process Reward Models or Monte Carlo Tree Search. [1/2]

submitted 20 days ago • 1 comment

We've just released the new Spaces search and it's quite mind blowing Explore over 400k AI Apps in the most intuitive way background removal, image-to-3D, comic factory, sound transcription, image editing, clothes virtual try-on, etc All made by AI builders for AI builders huggingface.co/spaces

submitted 21 days ago • 1 comment

« appending "Wait" multiple times to the model's generation » is our current most likely path to AGI :) See the fresh arxiv.org/abs/2501.19393 by Niklas Muennighoff et al.

submitted 23 days ago • 0 comments

Yes! a 15-min-read post to catch up on all experiments, reproductions, explorations around DeepSeek R1 We've summarized everything we did & saw in this first week since DS came to light Results/code/dataset/experiment => if we missed one, share & we'll update Link: huggingface.co/blog/open-r1...

submitted 25 days ago • 3 comments

I wrote some reflections on DeepSeek, open-source, AI, US and China, starting from Dario's recent essay calling for stronger export controls. I mostly disagree with his essay and think it missed the point You can read it here: thomwolf.io/blog/deepsee...

submitted 25 days ago • 2 comments

I was briefly chatting on Bloomberg earlier today about DeepSeek and open-source AI youtu.be/wjU2zTbrqQY?...

submitted 29 days ago • 1 comment

The most impactful open-source project of today (dixit Vercel VP of AI) => huggingface.co/blog/open-r1

submitted 30 days ago • 0 comments

Every time you declare open-source AI dead another team rise up like a Whac-A-Mole. Intelligence will be commoditized! The remaining moat will be the quality of the product and how it integrate with co-factors. That’s were billions dollars AI companies will be built.

submitted 31 days ago • 2 comments

The AI agent course keeps breaking records! Now breaking discord registration system You can still join at bit.ly/hf-learn-age... (let’s test discord infrastructure hahah)

submitted 36 days ago • 0 comments

Wait! Over 35 thousands people already registered for the AI Agent course in a few days... time to find the biggest conference center in the world and gather everyone for a giga-conference on agents? you can still join the online course here bit.ly/hf-learn-age...

submitted 38 days ago • 1 comment

🚨 I'll be in Davos next week as part of the Unicorn program at the World Economic Forum Let's meet if you're interested in leveraging/building/discussing AI!

submitted 42 days ago • 0 comments

Mind blown 👇 when people ask whether you need an agent framework at all! All evals should move to agentic evals in 2025 in my opinion. We’re just leaving so much capabilities of our models on the table. Benchmarked with smolagents: github.com/huggingface/...

submitted 43 days ago • 2 comments

Free course on Agents by Hugging Face. We just added a chapter to smol course on agents. Naturally, using smolagents! The course cover these topics: - Code agents - Retrieval agents - Custom functional If you're building agent applications, this course should help.

submitted 45 days ago • 1 comment

Yes!

submitted 49 days ago • 7 comments

What was the most impactful/visible/useful release on evaluation in AI in 2024?

submitted 52 days ago • 3 comments

New LLM evals entering and leaving the field saturated before the paper is even published in ML conferences 📸 matt_mccreary1

submitted 52 days ago • 2 comments

4 out of the 6 top trending repos on GitHub are from Chinese AI teams today 2025 is gonna look quite different from 2024

submitted 54 days ago • 2 comments

Our first release of 2025: 𝙨𝙢𝙤𝙡𝙖𝙜𝙚𝙣𝙩𝙨, 𝘁𝗵𝗲 𝘀𝗶𝗺𝗽𝗹𝗲𝘀𝘁 𝗹𝗶𝗯𝗿𝗮𝗿𝘆 𝘁𝗼 𝗯𝘂𝗶𝗹𝗱 𝗮𝗴𝗲𝗻𝘁𝗶𝗰 𝘀𝘆𝘀𝘁𝗲𝗺𝘀! 💥 Main logic in ~1000 LoC 🧑‍💻 Agent writes its actions in code! LLMs are much better at writing code than current standard of writing JSON => higher perf 🌍 Any LLM support (h/t LiteLLM) 🛡️ Secure code exec (h/t E2B)

submitted 56 days ago • 4 comments

With the new OpenAI O3 moving performance from 5% up to 25% on FrontierMath it’s time to push open-source models upwards! We're super happy to release FineMath, the best open math dataset yet. A strong baseline to start training your own models Find it in the trending section of HuggingFace ;)

submitted 66 days ago • 0 comments

what was this thing btw? "Moreover, ARC-AGI-1 is now saturating – besides o3's new score, the fact is that a large ensemble of low-compute Kaggle solutions can now score 81% on the private eval" big ensemble of heuristics?

submitted 67 days ago • 4 comments

challenge!

submitted 67 days ago • 1 comment

Want to grasp test time compute, the secret sauce behind recent AI breakthroughs? We've open-sourced the method - perfect holiday reading to understand what's powering the next wave of AI model development. 🧠✨ huggingface.co/spaces/Huggi...

submitted 68 days ago • 1 comment

Introducing 📐FineMath: the best open math pre-training dataset with 50B+ tokens! Math remains challenging for LLMs and by training on FineMath we see considerable gains over other math datasets, especially on GSM8K and MATH. 🤗 huggingface.co/datasets/Hug... Here’s a breakdown 🧵

submitted 69 days ago • 2 comments

2025 will be the year of AI for science Leveraging all the things we've recently learned training AI models for 1000x impact in science and this will need data! More details: huggingface.co/blog/lemater... Thread: ...

submitted 75 days ago • 2 comments

Big News in AI4Science! ✨ We are thrilled to launch LeMaterial, an open-source project in collaboration with @hf.co to accelerate materials discovery ⚛️🤗 Discover LeMat-Bulk: a 6.7M-entry dataset standardizing and unifying Materials Project, Alexandria and OQMD

submitted 77 days ago • 2 comments

The Open LLM Leaderboard got a new front page for Christmas Check it out at huggingface.co/spaces/open-...

submitted 78 days ago • 2 comments

When you’ve finished your day of emails

submitted 78 days ago • 2 comments

Spanish, Filipino, Amharic, French, German, Basque, Catalan, Galician, Guarani, Telugu, Italian, Pashto, Romanian, Tamil, Urdu, Danish... and many more! All included in the FineWeb2 Community Annotation Sprint! 🔥 💫 Join to build an impactful dataset for your language!

submitted 79 days ago • 1 comment

The FineWeb team is happy to finally release "FineWeb2" 🥂🥳 FineWeb 2 extends the data driven approach to pre-training dataset design that was introduced in FineWeb 1 to now covers 1893 languages/scripts Details: huggingface.co/datasets/Hug... A detailed open-science tech report is coming soon

submitted 81 days ago • 3 comments

Announcing 🥂 FineWeb2: A sparkling update with 1000s of 🗣️languages. We applied the same data-driven approach that led to SOTA English performance in🍷 FineWeb to thousands of languages. 🥂 FineWeb2 has 8TB of compressed text data and outperforms other datasets.

submitted 81 days ago • 1 comment

Four new visualisations of the rise of open-source AI models in 2024 added in our daily nuggets! - explore how various AI tasks have been growing - how community likes connect AI models together - the geography of models creators and followers Explore them here: huggingface.co/spaces/huggi...

submitted 80 days ago • 1 comment

The FineWeb team is happy to finally release "FineWeb2" 🥂🥳 FineWeb 2 extends the data driven approach to pre-training dataset design that was introduced in FineWeb 1 to now covers 1893 languages/scripts Details: huggingface.co/datasets/Hug... A detailed open-science tech report is coming soon

submitted 81 days ago • 3 comments

Announcing Global-MMLU - an improved MMLU Open dataset with evaluation coverage across 42 languages. The result of months of work with the goal of advancing Multilingual LLM evaluation. Built together with the community and amazing collaborators at Cohere4AI, MILA, MIT, and many more.

submitted 83 days ago • 4 comments