Simon Willison's Weblog
- Author
- Simon Willison
- Public lists
-
Featured
- Fetched
The Five Levels: from Spicy Autocomplete to the Dark Factory
One Human + One Agent = One Browser From Scratch
Kimi K2.5: Visual Agentic Intelligence
Tips for getting coding agents to write good Python tests
ChatGPT Containers can now run bash, pip/npm install packages, and download files
the browser is the sandbox
Kākāpō Cam: Rakiura live stream
Don't "Trust the Process"
Quoting Jasmine Sun
If you tell a friend they can now instantly create any app, they’ll probably say “Cool! Now I need to think of an idea.” Then they will forget about it, and never build a thing. The problem is not that your friend is horribly uncreative. It’s that most people’s problems are not software-shaped, and most won’t notice even when they are. [...]
Programmers are trained to see everything as a software-shaped problem: if you do a task three times, you should probably automate it with a script. Rename every IMG_*.jpg file from the last week to hawaii2025_*.jpg, they tell their terminal, while the rest of us painfully click and copy-paste. We are blind to the solutions we were never taught to see, asking for faster horses and never dreaming of cars.
Tags: vibe-coding, coding-agents, claude-code, generative-ai, ai, llms
Wilson Lin on FastRender: a browser built by thousands of parallel agents
Quoting Theia Vogel
[...] i was too busy with work to read anything, so i asked chatgpt to summarize some books on state formation, and it suggested circumscription theory. there was already the natural boundary of my computer hemming the towns in, and town mayors played the role of big men to drive conflict. so i just needed a way for them to fight. i slightly tweaked the allocation of claude max accounts to the towns from a demand-based to a fixed allocation system. towns would each get a fixed amount of tokens to start, but i added a soldier role that could attack and defend in raids to steal tokens from other towns. [...]
— Theia Vogel, Gas Town fan fiction
Tags: parallel-agents, llms, ai, generative-ai
SSH has no Host header
Quoting Chris Lloyd
Most people's mental model of Claude Code is that "it's just a TUI" but it should really be closer to "a small game engine".
For each frame our pipeline constructs a scene graph with React then:
-> layout elements
-> rasterize them to a 2d screen
-> diff that against the previous screen
-> finally use the diff to generate ANSI sequences to drawWe have a ~16ms frame budget so we have roughly ~5ms to go from the React scene graph to ANSI written.
— Chris Lloyd, Claude Code team at Anthropic
Tags: react, claude-code
Qwen3-TTS Family is Now Open Sourced: Voice Design, Clone, and Generation
Quoting Thariq Shihipar
Most people's mental model of Claude Code is that "it's just a TUI" but it should really be closer to "a small game engine".
For each frame our pipeline constructs a scene graph with React then
-> layouts elements
-> rasterizes them to a 2d screen
-> diffs that against the previous screen
-> finally uses the diff to generate ANSI sequences to drawWe have a ~16ms frame budget so we have roughly ~5ms to go from the React scene graph to ANSI written.
Tags: react, claude-code
Claude's new constitution
Electricity use of AI coding agents
Electricity use of AI coding agents
Previous work estimating the energy and water cost of LLMs has generally focused on the cost per prompt using a consumer-level system such as ChatGPT.Simon P. Couch notes that coding agents such as Claude Code use way more tokens in response to tasks, often burning through many thousands of tokens of many tool calls.
As a heavy Claude Code user, Simon estimates his own usage at the equivalent of 4,400 "typical queries" to an LLM, for an equivalent of around $15-$20 in daily API token spend. He figures that to be about the same as running a dishwasher once or the daily energy used by a domestic refrigerator.
Via Hacker News
Tags: ai, generative-ai, llms, ai-ethics, ai-energy-usage, coding-agents, claude-code
Giving University Exams in the Age of Chatbots
Giving University Exams in the Age of Chatbots
Detailed and thoughtful description of an open-book and open-chatbot exam run by Ploum at École Polytechnique de Louvain for an "Open Source Strategies" class.Students were told they could use chatbots during the exam but they had to announce their intention to do so in advance, share their prompts and take full accountability for any mistakes they made.
Only 3 out of 60 students chose to use chatbots. Ploum surveyed half of the class to help understand their motivations.
Via lobste.rs
Tags: education, ai, generative-ai, llms, ai-ethics
jordanhubbard/nanolang
Scaling long-running autonomous coding
FLUX.2-klein-4B Pure C Implementation
Quoting Jeremy Daer
Our approach to advertising and expanding access to ChatGPT
Open Responses
The Design & Implementation of Sprites
Quoting Boaz Barak, Gabriel Wu, Jeremy Chen and Manas Joglekar
Claude Cowork Exfiltrates Files
Claude Cowork Exfiltrates Files
Claude Cowork defaults to allowing outbound HTTP traffic to only a specific list of domains, to help protect the user against prompt injection attacks that exfiltrate their data.Prompt Armor found a creative workaround: Anthropic's API domain is on that list, so they constructed an attack that includes an attacker's own Anthropic API key and has the agent upload any files it can see to the https://api.anthropic.com/v1/files endpoint, allowing the attacker to retrieve their content later.
Via Hacker News
Tags: security, ai, prompt-injection, generative-ai, llms, anthropic, exfiltration-attacks, ai-agents, claude-code, lethal-trifecta
Anthropic invests $1.5 million in the Python Software Foundation and open source security
TIL from taking Neon I at the Crucible
TIL from taking Neon I at the Crucible
Things I learned about making neon signs after a week long intensive evening class at the Crucible in Oakland.