Simon Willison's Weblog
- Author
- Simon Willison
- Public lists
-
Featured
- Fetched
GitHub MCP Exploited: Accessing private repositories via MCP
Build AI agents with the Mistral Agents API
Large Language Models can run tools in your terminal with LLM 0.26
At Amazon, Some Coders Say Their Jobs Have Begun to Resemble Warehouse Work
llm-llama-server 0.2
Highlights from the Claude 4 system prompt
Subscribe to my sponsors-only monthly newsletter.
System Card: Claude Opus 4 & Claude Sonnet 4
How I used o3 to find CVE-2025-37899, a remote zeroday vulnerability in the Linux kernel’s SMB implementation
f2
Honey badger
I'm helping make some changes to a large, complex and very unfamiliar to me WordPress site. It's a perfect opportunity to try out Claude Code running against the new Claude 4 models.
It's going extremely well. So far Claude has helped get MySQL working on an older laptop (fixing some inscrutable Homebrew errors), disabled a CAPTCHA plugin that didn't work on localhost, toggled visible warnings on and off several times and figured out which CSS file to modify in the theme that the site is using. It even took a reasonable stab at making the site responsive on mobile!
I'm now calling Claude Code honey badger on account of its voracious appetite for crunching through code (and tokens) looking for the right thing to fix.
I got ChatGPT to make me some fan art:

Tags: anthropic, claude, wordpress, ai, llms, ai-assisted-programming, generative-ai, homebrew, claude-4
Remote Prompt Injection in GitLab Duo Leads to Source Code Theft
Agents are models using tools in a loop
I was going slightly spare at the fact that every talk at this Anthropic developer conference has used the word "agents" dozens of times, but nobody ever stopped to provide a useful definition.
I'm now in the "Prompting for Agents" workshop and Anthropic's Hannah Moran finally broke the trend by saying that at Anthropic:
Agents are models using tools in a loop
I can live with that! I'm glad someone finally said it out loud.
Tags: anthropic, generative-ai, ai-agents, ai, llms
Updated Anthropic model comparison table
llm-anthropic 0.16
Live blog: Claude 4 launch at Code with Claude
I'm at Anthropic's Code with Claude event, where they are launching Claude 4. I'll be live blogging the keynote here.
Tags: llm-release, liveblogging, anthropic, claude, generative-ai, ai, llms, pelican-riding-a-bicycle, claude-4
No docs, no bugs
If your library doesn't have any documentation, it can't have any bugs.
Documentation specifies what your code is supposed to do. Your tests specify what it actually does.
Bugs exist when your test-enforced implementation fails to match the behavior described in your documentation. Without documentation a bug is just undefined behavior.
If you aim to follow semantic versioning you bump your major version when you release a backwards incompatible change. Such changes cannot exist if your code is not comprehensively documented!
Inspired by a half-remembered conversation I had with Tom Insam many years ago.
Tags: testing, semantic-versioning, documentation
Devstral
Gemini Diffusion
Chicago Sun-Times Prints AI-Generated Summer Reading List With Books That Don't Exist
I really don't like ChatGPT's new memory dossier
We did the math on AI’s energy footprint. Here’s the story you haven’t heard.
Gemini 2.5: Our most intelligent models are getting even better
Google I/O Pelican
Tucked into today's Google I/O keynote, a blink-and-you'll miss it moment:
The pelican in the keynote was created by Alexander Chen. Here's the code they wrote with the help of Gemini, which uses p5.js to power the animation.
Tags: pelican-riding-a-bicycle, google-io, google
cityofaustin/atd-data-tech issues
cityofaustin/atd-data-tech issues
I stumbled across this today while looking for interesting frequently updated data sources from local governments. It turns out the City of Austin's Transportation Data & Technology Services department run everything out of a public GitHub issues instance, which currently has 20,225 closed and 2,002 open issues. They also publish an exported copy of the issues data through the data.austintexas.gov open data portal.Tags: open-data, github, github-issues