Profile avatar
simon.fedi.simonwillison.net.ap.brid.gy
Open source developer building tools to help journalists, archivists, librarians and others analyze, explore and publish their data. https://datasette.io […] [bridged from https://fedi.simonwillison.net/@simon on the fediverse by https://fed.brid.gy/ ]
985 posts 8,657 followers 2 following
Regular Contributor
Active Commenter

I'm now calling Claude Code "honey badger" on account of its voracious appetite for crunching through code and tokens looking for the right thing to fix https://simonwillison.net/2025/May/23/honey-badger/

Once again, if your LLM system combines access to private data, exposure to malicious instructions and the ability to exfiltrate information (through tool use or through rendering links and images) you have a nasty security hole This time, GitLab […]

Started a live blog for today's Claude 4 release at Code with Claude https://simonwillison.net/2025/May/22/code-with-claude-live-blog/

If your library doesn't have any documentation, it can't have any bugs https://simonwillison.net/2025/May/22/no-docs-no-bugs/

Today's other big new model is Devstral, an Apache 2.0 licensed LLM that specializes in code and seems very good from my initial experiments It's a 14GB download from Ollama, notes here https://simonwillison.net/2025/May/21/devstral/

I got access to Gemini Diffusion, Google's first diffusion LLM, and the thing is absurdly fast - it ran at 857 tokens/second and built me a prototype chat interface in just a couple of seconds, video here: https://simonwillison.net/2025/May/21/gemini-diffusion/

ChatGPT's new dossier-from-your-chats feature is a huge change to how it works, and as a power user who tries to control all of the model's input I don't like it at all “30 messages are good interaction quality (25%); 9 messages are bad interaction quality (7%)” […]

There was a pelican riding a bicycle in today's Google I/O keynote! https://simonwillison.net/2025/May/20/google-io-pelican/

I think the hardest problem in computer science may be "if I add 'PyMuPDF' (AGPL) to my open source Python library as a dependency in setup.py/pyproject.toml does that mean my entire library needs to be licensed AGPL? And if it's a plugin (loaded via pluggy/entry points) does that impact the […]

I built a new LLM plugin that can turn a PDF into an image-per-page for feeding into vision models, and in testing it found that GPT-4.1 mini hallucinates WILDLY if you feed it a blank white rectangle followed by a blank black rectangle https://simonwillison.net/2025/May/18/llm-pdf-to-images/

Qwen 2.5 VL is out on Ollama - I tried the 6GB one and it worked for describing a photo but gave me "It looks like the image you provided is a jumbled and distorted text" when I tried using it for OCR Anyone had any luck with it for that? https://simonwillison.net/2025/May/18/qwen25vl-in-ollama/

If you're at #PyConUS I would LOVE to talk to you about my various projects - I'll be stood next to my poster in Hall A tomorrow from 10am to 1pm, please drop by and say hi My poster is the one with the "teenage bedroom" aesthetic, I decided to fill […] [Original post on fedi.simonwillison.net]

Here's the full workshop handout plus annotated slides from "Building software on top of Large Language Models", a three hour tutorial I presented yesterday at PyCon US #PyConUS https://simonwillison.net/2025/May/15/building-on-llms/

I've been working on this for a while... llm --functions ' def multiply(x: int, y: int) -> int: """Multiply two numbers.""" return x * y ' 'what is 34234 * 213345' -m o4-mini https://simonwillison.net/2025/May/14/llm-adds-support-for-tools/

I asked o4-mini-high about OpenAI's core engineering principles and it either leaked or hallucinated the URL of OpenAI's internal engineering handbook https://simonwillison.net/2025/May/13/launching-chatgpt-images/

Made some notes on how Cursor works under the hood based on their security documentation - it turns out an organization's list of subprocessors offers a loose form of "view source" for their infrastructure! https://simonwillison.net/2025/May/11/cursor-security/

llama.cpp shipped new support for vision models this morning, including macOS binaries (albeit quarantined so you have to take extra steps to run them) that let you run vision models in a terminal or as a localhost web UI My notes on how to get it running on a Mac […]

Gemini 2.5 now applies the 75% cached token discount automatically - previously you had to manually configure it Potentially big cost savings here for applications that run prompts against the same long context, or continue existing conversations […]

Some notes on the gemini-2.0-flash-preview-image-generation model that's now available via the Gemini API https://simonwillison.net/2025/May/7/gemini-images-preview/

Published some notes on Microsoft's phi4-reasoning model, an 11GB download (via Ollama) which may well overthink things... it produced 56 sentences of reasoning output in response to my prompt of "hi" https://simonwillison.net/2025/May/6/phi-4-reasoning/

New Gemini 2.5 Pro preview model today, released in advance of Google I/O. Google claim it is state of the art in both frontend code generation and video understanding. https://simonwillison.net/2025/May/6/gemini-25-pro-preview/

New release of LLM, accompanied by a new plugin - you can now use llm-video-frames to turn a video file into a sequence of JPEGs and feed those into a long-context vision model like GPT-4.1-mini https://simonwillison.net/2025/May/5/llm-video-frames/