Profile avatar
dataschool.io
Teaching Data Science πŸ€– with Python 🐍 since 2014 Courses: https://courses.dataschool.io Newsletter: https://tuesday.tips Videos: https://www.youtube.com/@dataschool Living in beautiful Asheville, North Carolina 🌲
138 posts 934 followers 42 following
Prolific Poster
Conversation Starter

My thoughts on the current state of AI progress and the most important developments in 2025: www.dataschool.io/ai-progress-...

Security researcher finds zero-day vulnerability in Linux kernel using o3 + @simonwillison.net's llm tool sean.heelan.io/2025/05/22/h...

Claude 4 API is now available!

πŸ™‹β€β™€οΈ Curious about Retrieval Augmented Generation (RAG)? πŸ‘¨β€πŸ« Allow me to explain! Here's the video outline: 0:00 RAG definition 0:14 Problem that RAG solves 0:49 RAG part 1: Indexing 1:43 RAG part 2: Retrieval and generation 2:38 How RAG prevents hallucinations

πŸš€ Just launched my new course: Build #AI agents with #Python 🏷 Save $50 through Friday: courses.dataschool.io/build-ai-age... πŸ™ Please share with a friend... thank you!

Published some notes on Microsoft's phi4-reasoning model, an 11GB download (via Ollama) which may well overthink things... it produced 56 sentences of reasoning output in response to my prompt of "hi" simonwillison.net/2025/May/6/p...

Claude finally integrated web search into its results... But with LangChain & LangGraph, you can build a chatbot that integrates web search into ANY model you like! You'll learn how to do that (and much more) in my new AI course... Sign up for EARLY ACCESS: πŸ‘‰ dataschool.kit.com/agents πŸ‘ˆ

πŸ—’οΈ Do you need to prepare a ML model, and you are working with text and strings? Skrub provides four encoders to convert strings into numerical features. πŸ€— models included! What's the best? Check out our blog post to find out πŸ‘€ skrub-data.org/skrub-materi...

Just finished recording my new AI course πŸ˜… Sign up for early access: dataschool.kit.com/agents

Many good advices/best practices for missing value imputation in the paper! I now have a much deeper appreciation for Data School's course and regard it as the best scikit-learn course. Master Machine Learning with scikit-learn: courses.dataschool.io/master-machi...

In honor of March Madness πŸ€, I've got a new blog post: www.dataschool.io/pandas-strea... Learn how to identify & analyze scoring streaks using pandas operations: - shift() - cumsum() - boolean math - groupby()

"To Type or Not to Type?" is an excellent introduction (and reference guide) to type hints in Python: blog.jonathanchun.com/2025/02/16/t...

"Some people today are discouraging others from learning programming on the grounds AI will automate it. This advice will be seen as some of the worst career advice ever given." -- Andrew Ng, legendary AI researcher Source: www.deeplearning.ai/the-batch/is...

Here's the table of contents for my lengthy new piece on how I use LLMs to help me write code simonwillison.net/2025/Mar/11/...

These are @simonwillison.net's current favorite AI models for Python programming: - Claude 3.7 Sonnet with thinking turned on - OpenAI's o3-mini-high - GPT-4o with Code Interpreter Source: simonwillison.net/2025/Mar/2/h...

19 professionals (in a variety of fields) evaluated OpenAI's Deep Research vs Google's Deep Research. OpenAI was the clear winner πŸ† Neat study by @binarybits.bsky.social, read more here: www.understandingai.org/p/these-expe...

"The next time someone predicts that we will soon have AI that can do anything 'a human can do', ask them what they think a human can do. You may find that they are only considering capabilities that are easily measured." Source: amistrongeryet.substack.com/p/where-ai-f...

A couple months ago @dataschool.io wrote about a tool he uses to chat with different LLM models without paying a monthly subscription to all of them. The tool is called Typing Mind and I decided to pay $30 for lifetime access. It was well worth it. Kevin's post πŸ‘‡ www.dataschool.io/save-money-o...

Want to keep up with the latest developments in AI? Check out the 7 newsletters I personally read & recommend for AI news, insights, and analysis: πŸ‘‰ www.dataschool.io/best-ai-news... πŸ‘ˆ Includes: @simonwillison.net @emollick.bsky.social @sebastianraschka.com @binarybits.bsky.social

@gruber.foo Do your friends & family know you invented Markdown? Because it's pretty cool that something you made is used so heavily by LLMs, which is arguably the most important innovation in technology today.

Here's my personal guide to Bluesky happiness 🌈 1. Be very selective about who I follow (42 people at the moment) 2. Read everything in the "Following" feed, and nothing else 3. Trust that the people I follow will surface other good people to follow 4. Unfollow anyone who no longer makes the cut

Wondering about the differences between "Jupyter", "Jupyter Notebook", "JupyterLab", "IPython", "Colab", and other related terms? I'll explain these terms (and more) in 5 minutes: ▢️ www.youtube.com/watch?v=TDlG...

Economist Tyler Cowen on OpenAI's Deep Research: "I think of the quality as comparable to having a good PhD-level research assistant, and sending that person away with a task for a week or two, or maybe more." marginalrevolution.com/marginalrevo...

Comparing: - Software Developers - ML Engineers - LLM Developers - Prompt Engineers Read here: www.louisbouchard.ai/llm-develope... Image & article by Louis-FranΓ§ois Bouchard

Got a Python-based SaaS idea brewing? Let’s team up. I’m forming a small mastermind to talk tech, marketing, and everything in between. We’ll meet twice a month, hang in a private Discord, and it’s all 100% free. Interested? #Python #Startups Join here: docs.google.com/forms/d/e/1F...

Economist Tyler Cowen on o1 pro: "I think only a relatively small number of humans in the world can give better answers to what I want to know." "Yes, it does cost $200 a month. It is worth that sum to converse with the smartest entity yet devised." marginalrevolution.com/marginalrevo...

My thoughts on whether "coding is dead" in the age of AI πŸ‘‰ www.dataschool.io/ai-vs-coding/ πŸ‘ˆ

Want to keep up with the latest developments in AI? Check out the 7 newsletters I personally read & recommend for AI news, insights, and analysis: πŸ‘‰ www.dataschool.io/best-ai-news... πŸ‘ˆ Includes: @simonwillison.net @emollick.bsky.social @sebastianraschka.com @binarybits.bsky.social

Wrangling string columns for machine learning, the new StringEncoder in @skrub-data.bsky.social gives such a good compute/prediction performance tradeoff. It's mostly just a bunch of simple tricks, but with well-chosen defaults. This is what we aim for in skrub skrub-data.org/stable/refer...

We totally care if you like this post about β€˜sarcasm’ and β€˜irony.’ These words are sometimes treated as if they mean the same thing, but they have distinct meanings that are interesting and useful to understand. πŸ§΅β¬‡οΈ

I asked GPT-4o to write a New Yorker-style caption for this real New Yorker cartoon. Here was my favorite: "Just wait until the Series A investors hear we’ve gone fully remote." And the real caption: "The wind in your hair, the scent of nature all aroundβ€”this is what data entry was meant to be."