Simon Willison's Weblog
- Author
- Simon Willison
- Public lists
-
Featured
- Fetched
Why NetNewsWire Is Not a Web App
Python 3.14
Retraction
I retracted this article after confirming that, while Gemini 2.5 Computer Use did try to solve Google's own CAPTCHA, it was actually Browserbase that successfully performed the solve. I'll update this with more details shortly.
Tags: gemini
Quoting Thomas Klausner
For quite some I wanted to write a small static image gallery so I can share my pictures with friends and family. Of course there are a gazillion tools like this, but, well, sometimes I just want to roll my own. [...]
I used the old, well tested technique I call brain coding, where you start with an empty vim buffer and type some code (Perl, HTML, CSS) until you're happy with the result. It helps to think a bit (aka use your brain) during this process.
— Thomas Klausner, coining "brain coding"
Tags: vibe-coding, definitions
Vibe engineering
Deloitte to pay money back to Albanese government after using AI in $440,000 report
gpt-image-1-mini
a system that can do work independently on behalf of the user
I've settled on agents as meaning "LLMs calling tools in a loop to achieve a goal" but OpenAI continue to muddy the waters with much more vague definitions. Swyx spotted this one in the press pack OpenAI sent out for their DevDay announcements today:
How does OpenAl define an "agent"? An Al agent is a system that can do work independently on behalf of the user.
Adding this one to my collection.
Tags: ai-agents, openai, agent-definitions, swyx
GPT-5 pro
OpenAI DevDay 2025 live blog
I'm at OpenAI DevDay in Fort Mason, San Francisco today. As I did last year, I'm going to be live blogging the announcements from the kenote. Unlike last year, this year there's a livestream.
Tags: liveblogging, ai, openai, generative-ai, llms
Quoting Tim Berners-Lee
I believed that giving users such a simple way to navigate the internet would unlock creativity and collaboration on a global scale. If you could put anything on it, then after a while, it would have everything on it.
But for the web to have everything on it, everyone had to be able to use it, and want to do so. This was already asking a lot. I couldn’t also ask that they pay for each search or upload they made. In order to succeed, therefore, it would have to be free. That’s why, in 1993, I convinced my Cern managers to donate the intellectual property of the world wide web, putting it into the public domain. We gave the web away to everyone.
— Tim Berners-Lee, Why I gave the world wide web away for free
Tags: web, tim-berners-lee, computer-history
Bad bots
Embracing the parallel coding agent lifestyle
Let the LLM Write the Prompts: An Intro to DSPy in Compound Al Pipelines
Litestream v0.5.0 is Here
Sora 2 prompt injection
Daniel Stenberg's note on AI assisted curl bug reports
Quoting Nadia Eghbal
When attention is being appropriated, producers need to weigh the costs and benefits of the transaction. To assess whether the appropriation of attention is net-positive, it’s useful to distinguish between extractive and non-extractive contributions. Extractive contributions are those where the marginal cost of reviewing and merging that contribution is greater than the marginal benefit to the project’s producers. In the case of a code contribution, it might be a pull request that’s too complex or unwieldy to review, given the potential upside
— Nadia Eghbal, Working in Public, via the draft LLVM AI tools policy
Tags: ai-ethics, open-source, vibe-coding, ai-assisted-programming, generative-ai, ai, llms, definitions
Two more Chinese pelicans
aavetis/PRarena
September monthly sponsors newsletter
I just sent out the September edition of my sponsors-only monthly newsletter. If you are a sponsor (or if you start a sponsorship now) you can access a copy here. The sections this month are:
- Best model for code? GPT-5-Codex... then Claude 4.5 Sonnet
- I've grudgingly accepted a definition for "agent"
- GPT-5 Research Goblin and Google AI Mode
- Claude has Code Interpreter now
- The lethal trifecta in the Economist
- Other significant model releases
- Notable AI success stories
- Video models are zero-shot learners and reasoners
- Tools I'm using at the moment
- Other bits and pieces
Here's a copy of the August newsletter as a preview of what you'll get. Pay $10/month to stay a month ahead of the free copy!
Tags: newsletter
Sora 2
Designing agentic loops
Claude Sonnet 4.5 is probably the "best coding model in the world" (at least for now)
Armin Ronacher: 90%
Quoting Scott Aaronson
Quoting Nick Turley
We’ve seen the strong reactions to 4o responses and want to explain what is happening.
We’ve started testing a new safety routing system in ChatGPT.
As we previously mentioned, when conversations touch on sensitive and emotional topics the system may switch mid-chat to a reasoning model or GPT-5 designed to handle these contexts with extra care. This is similar to how we route conversations that require extra thinking to our reasoning models; our goal is to always deliver answers aligned with our Model Spec.
Routing happens on a per-message basis; switching from the default model happens on a temporary basis. ChatGPT will tell you which model is active when asked.
— Nick Turley, Head of ChatGPT, OpenAI
Tags: generative-ai, openai, chatgpt, ai, llms, nick-turley