Sign up

Simon Willison's Weblog

Not verified No WebSub updates Supports Webmention Not yet validated

Author
Simon Willison
Public lists
Featured
Fetched

Simon Willison's Weblog Supports Webmention

An Introduction to Google’s Approach to AI Agent Security

Here's another new paper on AI agent security: An Introduction to Google’s Approach to AI Agent Security, by Santiago Díaz, Christoph Kern, and Kara Olive. (I wrote about a different recent paper, Design Patterns for Securing LLM Agents against Prompt Injections just a few d...

Simon Willison's Weblog Supports Webmention

Anthropic: How we built our multi-agent research system

Anthropic: How we built our multi-agent research system OK, I'm sold on multi-agent LLM systems now. I've been pretty skeptical of these until recently: why make your life more complicated by running multiple different prompts in parallel when you can usually get something u...

Simon Willison's Weblog Supports Webmention

llm-fragments-youtube

llm-fragments-youtube Excellent new LLM plugin by Agustin Bacigalup which lets you use the subtitles of any YouTube video as a fragment for running prompts against. I tried it out like this: llm install llm-fragments-youtube llm -f youtube:dQw4w9WgXcQ \ 'summary of people ...

Simon Willison's Weblog Supports Webmention

Quoting Google Cloud outage incident report

Google Cloud, Google Workspace and Google Security Operations products experienced increased 503 errors in external API requests, impacting customers. [...] On May 29, 2025, a new feature was added to Service Control for additional quota policy checks. This code change and ...

Simon Willison's Weblog Supports Webmention

The Wikimedia Research Newsletter

The Wikimedia Research Newsletter Speaking of summarizing research papers, I just learned about this newsletter and it is an absolute gold mine: The Wikimedia Research Newsletter (WRN) covers research of relevance to the Wikimedia community. It has been appearing generally ...

Simon Willison's Weblog Supports Webmention

Quoting Andrew Ng

There’s a new breed of GenAI Application Engineers who can build more-powerful applications faster than was possible before, thanks to generative AI. Individuals who can play this role are highly sought-after by businesses, but the job description is still coming into focus. [...]

Skilled GenAI Application Engineers meet two primary criteria: (i) They are able to use the new AI building blocks to quickly build powerful applications. (ii) They are able to use AI assistance to carry out rapid engineering, building software systems in dramatically less time than was possible before. In addition, good product/design instincts are a significant bonus.

Andrew Ng

Tags: careers, ai-assisted-programming, generative-ai, ai, llms, andrew-ng

Simon Willison's Weblog Supports Webmention

Blogging about papers

My post this morning about Design Patterns for Securing LLM Agents against Prompt Injections is an example of a blogging format I'd love to see more of: informal but informed commentary on academic papers. Academic papers are generally hard to read. Sadly that's almost a req...

Simon Willison's Weblog Supports Webmention

Design Patterns for Securing LLM Agents against Prompt Injections

This new paper by 11 authors from organizations including IBM, Invariant Labs, ETH Zurich, Google and Microsoft is an excellent addition to the literature on prompt injection and LLM security. In this work, we describe a number of design patterns for LLM agents that signifi...

Simon Willison's Weblog Supports Webmention

It's this blog's 23rd birthday

It's this blog's 23rd birthday today!

On June 12th 2022 I celebrated Twenty years of my blog with a big post full of highlights. Looking back now I'm amused to notice that my 20th birthday post came within two weeks of my earliest writing about LLMs: A Datasette tutorial written by GPT-3 and How to use the GPT-3 language model.

My generative-ai tag has reached 1,184 posts now.

I really do feel like blogging is onto its second wind. The amount of influence you can have on the world by consistently blogging about a subject is just as high today as it was back in the 2000s when blogging first started.

The best time to start a blog may have been twenty years ago, but the second best time to start a blog is today.

Tags: generative-ai, blogging

Simon Willison's Weblog Supports Webmention

‘How come I can’t breathe?': Musk’s data company draws a backlash in Memphis

‘How come I can’t breathe?': Musk’s data company draws a backlash in Memphis The biggest environmental scandal in AI right now should be the xAI data center in Memphis, which has been running for nearly a year on 35 methane gas turbines under a "temporary" basis: The turbin...

Simon Willison's Weblog Supports Webmention

Agentic Coding Recommendations

Agentic Coding Recommendations

There's a ton of actionable advice on using Claude Code in this new piece from Armin Ronacher. He's getting excellent results from Go, especially having invested a bunch of work in making the various tools (linters, tests, development servers etc) as accessible as possible through documenting them in a Makefile.

Armin also recently shared a half hour YouTube video in which he worked with Claude Code to resolve two medium complexity issues in his minijinja Rust templating library, resulting in PR #805 and PR #804.

Via @mitsuhiko.at

Tags: go, ai, llms, rust, ai-assisted-programming, coding-agents, generative-ai, armin-ronacher, anthropic, claude, claude-code

Simon Willison's Weblog Supports Webmention

Disney and Universal Sue AI Company Midjourney for Copyright Infringement

Disney and Universal Sue AI Company Midjourney for Copyright Infringement

This is a big one. It's very easy to demonstrate that Midjourney will output images of copyright protected characters (like Darth Vader or Yoda) based on a short text prompt.

There are already dozens of copyright lawsuits against AI companies winding through the US court system—including a class action lawsuit visual artists brought against Midjourney in 2023—but this is the first time major Hollywood studios have jumped into the fray.

Here's the lawsuit on Document Cloud - 110 pages, most of which are examples of supposedly infringing images.

101. In response to the prompt, "Sulley, animated," Midjourney accessed the data about Disney's Copyrighted Works that is stored by the Image Service and then reproduced, publicly displayed, and made available for download an image output that copies Disney's Sulley character, as shown in this screenshot: Midjourney Output (clearly Sulley from Monsters, Inc.). Disney's Copyrighted Character(s): Sulley from Monsters, Inc.

Tags: ai-ethics, midjourney, generative-ai, training-data, ai, law

Simon Willison's Weblog Supports Webmention

Breaking down ‘EchoLeak’, the First Zero-Click AI Vulnerability Enabling Data Exfiltration from Microsoft 365 Copilot

Breaking down ‘EchoLeak’, the First Zero-Click AI Vulnerability Enabling Data Exfiltration from Microsoft 365 Copilot Aim Labs reported CVE-2025-32711 against Microsoft 365 Copilot back in January, and the fix is now rolled out. This is an extended variant of the prompt inje...

Simon Willison's Weblog Supports Webmention

Disney and Universal Sue AI Company Midjourney for Copyright Infringement

Disney and Universal Sue AI Company Midjourney for Copyright Infringement

This is a big one. It's very easy to demonstrate that Midjourney will output images of copyright protected characters (like Darth Vader or Yoda) based on a short text prompt.

There are already dozens of copyright lawsuits against AI companies winding through the US court system—including a class action lawsuit visual artists brought against Midjourney in 2023—but this is the first time major Hollywood studios have jumped into the fray.

Tags: ai-ethics, midjourney, generative-ai, training-data, ai, law

Simon Willison's Weblog Supports Webmention

Malleable software

Malleable software New, delightful manifesto from Ink & Switch. In this essay, we envision malleable software: tools that users can reshape with minimal friction to suit their unique needs. Modification becomes routine, not exceptional. Adaptation happens at the point o...

Simon Willison's Weblog Supports Webmention

Quoting datarama

Since Jevons' original observation about coal-fired steam engines is a bit hard to relate to, my favourite modernized example for people who aren't software nerds is display technology. Old CRT screens were horribly inefficient - they were large, clunky and absolutely guzzl...

Simon Willison's Weblog Supports Webmention

Quoting Ted Sanders

[on the cheaper o3] Not quantized. Weights are the same.

If we did change the model, we'd release it as a new model with a new name in the API (e.g., o3-turbo-2025-06-10). It would be very annoying to API customers if we ever silently changed models, so we never do this [1].

[1] chatgpt-4o-latest being an explicit exception

Ted Sanders, Research Manager, OpenAI

Tags: generative-ai, openai, o3, ai, llms

Simon Willison's Weblog Supports Webmention

Quoting Sam Altman

(People are often curious about how much energy a ChatGPT query uses; the average query uses about 0.34 watt-hours, about what an oven would use in a little over one second, or a high-efficiency lightbulb would use in a couple of minutes. It also uses about 0.000085 gallons of water; roughly one fifteenth of a teaspoon.)

Sam Altman, The Gentle Singularity

Tags: sam-altman, generative-ai, ai-energy-usage, openai, chatgpt, ai, llms

Simon Willison's Weblog Supports Webmention

AI-assisted coding for teams that can't get away with vibes

AI-assisted coding for teams that can't get away with vibes This excellent piece by Atharva Raykar offers a bunch of astute observations on AI-assisted development that I haven't seen written down elsewhere. Building with AI is fast. The gains in velocity are important, bec...

Simon Willison's Weblog Supports Webmention

o3-pro

o3-pro OpenAI released o3-pro today, which they describe as a "version of o3 with more compute for better responses". It's only available via the newer Responses API. I've added it to my llm-openai-plugin plugin which uses that new API, so you can try it out like this: llm i...

Simon Willison's Weblog Supports Webmention

o3 price drop

OpenAI just dropped the price of their o3 model by 80% - from $10/million input tokens and $40/million output tokens to just $2/million and $8/million for the very same model. This is in advance of the release of o3-pro which apparently is coming later today.

This is a pretty huge shake-up in LLM pricing. o3 is now priced the same as GPT 4.1, and slightly less than GPT-4o ($2.50/$10). It’s also less than Anthropic’s Claude Sonnet 4 ($3/$15) and Opus 4 ($15/$75) and sits in between Google’s Gemini 2.5 Pro for >200,00 tokens ($2.50/$15) and 2.5 Pro for <200,000 ($1.25/$10).

I’ve updated my llm-prices.com pricing calculator with the new rate.

How have they dropped the price so much? OpenAI's Adam Groth credits ongoing optimization work:

thanks to the engineers optimizing inferencing.

Tags: generative-ai, openai, o3, llm-pricing, ai, llms

Simon Willison's Weblog Supports Webmention

Magistral — the first reasoning model by Mistral AI

Magistral — the first reasoning model by Mistral AI Mistral's first reasoning model is out today, in two sizes. There's a 24B Apache 2 licensed open-weights model called Magistral Small (actually Magistral-Small-2506), and a larger API-only model called Magistral Medium. Mag...

Simon Willison's Weblog Supports Webmention

Quoting David Crawshaw

The process of learning and experimenting with LLM-derived technology has been an exercise in humility. In general I love learning new things when the art of programming changes […] But LLMs, and more specifically Agents, affect the process of writing programs in a new and confusing way. Absolutely every fundamental assumption about how I work has to be questioned, and it ripples through all the experience I have accumulated. There are days when it feels like I would be better off if I did not know anything about programming and started from scratch. And it is still changing.

David Crawshaw, How I program with Agents

Tags: coding-agents, ai-assisted-programming, generative-ai, ai-agents, ai, llms

Simon Willison's Weblog Supports Webmention

OpenAI hits $10 billion in annual recurring revenue fueled by ChatGPT growth

OpenAI hits $10 billion in annual recurring revenue fueled by ChatGPT growth

Noteworthy because OpenAI revenue is a useful indicator of the direction of the generative AI industry in general, and frequently comes up in conversations about the sustainability of the current bubble.

OpenAI has hit $10 billion in annual recurring revenue less than three years after launching its popular ChatGPT chatbot.

The figure includes sales from the company’s consumer products, ChatGPT business products and its application programming interface, or API. It excludes licensing revenue from Microsoft and large one-time deals, according to an OpenAI spokesperson.

For all of last year, OpenAI was around $5.5 billion in ARR. [...]

So these new numbers represent nearly double the ARR figures for last year.

Tags: openai, llms, ai, generative-ai

Simon Willison's Weblog Supports Webmention

WWDC: Apple supercharges its tools and technologies for developers

WWDC: Apple supercharges its tools and technologies for developers Here's the Apple press release for today's WWDC announcements. Two things that stood out to me: Foundation Models Framework With the Foundation Models framework, developers will be able to build on Apple Int...

Simon Willison's Weblog Supports Webmention

Qwen3 Embedding

Qwen3 Embedding New family of embedding models from Qwen, in three sizes: 0.6B, 4B, 8B - and two categories: Text Embedding and Text Reranking. The full collection can be browsed on Hugging Face. The smallest available model is the 0.6B Q8 one, which is available as a 639MB ...

Simon Willison's Weblog Supports Webmention

Comma v0.1 1T and 2T - 7B LLMs trained on openly licensed text

It's been a long time coming, but we finally have some promising LLMs to try out which are trained entirely on openly licensed text! EleutherAI released the Pile four and a half years ago: "an 800GB dataset of diverse text for language modeling". It's been used as the basis ...

Simon Willison's Weblog Supports Webmention

Quoting Lila Shapiro

For [Natasha] Lyonne, the draw of AI isn’t speed or scale — it’s independence. “I’m not trying to run a tech company,” she told me. “It’s more that I’m a filmmaker who doesn’t want the tech people deciding the future of the medium.” She imagines a future in which indie film...

Simon Willison's Weblog Supports Webmention

The last six months in LLMs, illustrated by pelicans on bicycles

I presented an invited keynote at the AI Engineer World's Fair in San Francisco this week. This is my third time speaking at the event - here's my talks from October 2023 and June 2024. My topic this time was "The last six months in LLMs" - originally planned as the last yea...

Simon Willison's Weblog Supports Webmention

gemini-2.5-pro-preview-06-05: Try the latest Gemini 2.5 Pro before general availability

gemini-2.5-pro-preview-06-05: Try the latest Gemini 2.5 Pro before general availability Announced on stage today by Logan Kilpatrick at the AI Engineer World’s Fair, who indicated that this will likely be the last in the Gemini 2.5 Pro series. The previous model ID was gemin...