Simon Willison's Weblog
- Author
- Simon Willison
- Public lists
-
Featured
- Fetched
Anthropic: How we built our multi-agent research system
llm-fragments-youtube
Quoting Google Cloud outage incident report
The Wikimedia Research Newsletter
Quoting Andrew Ng
There’s a new breed of GenAI Application Engineers who can build more-powerful applications faster than was possible before, thanks to generative AI. Individuals who can play this role are highly sought-after by businesses, but the job description is still coming into focus. [...]
Skilled GenAI Application Engineers meet two primary criteria: (i) They are able to use the new AI building blocks to quickly build powerful applications. (ii) They are able to use AI assistance to carry out rapid engineering, building software systems in dramatically less time than was possible before. In addition, good product/design instincts are a significant bonus.
Tags: careers, ai-assisted-programming, generative-ai, ai, llms, andrew-ng
Blogging about papers
Design Patterns for Securing LLM Agents against Prompt Injections
It's this blog's 23rd birthday
It's this blog's 23rd birthday today!
On June 12th 2022 I celebrated Twenty years of my blog with a big post full of highlights. Looking back now I'm amused to notice that my 20th birthday post came within two weeks of my earliest writing about LLMs: A Datasette tutorial written by GPT-3 and How to use the GPT-3 language model.
My generative-ai tag has reached 1,184 posts now.
I really do feel like blogging is onto its second wind. The amount of influence you can have on the world by consistently blogging about a subject is just as high today as it was back in the 2000s when blogging first started.
The best time to start a blog may have been twenty years ago, but the second best time to start a blog is today.
Tags: generative-ai, blogging
‘How come I can’t breathe?': Musk’s data company draws a backlash in Memphis
Agentic Coding Recommendations
Agentic Coding Recommendations
There's a ton of actionable advice on using Claude Code in this new piece from Armin Ronacher. He's getting excellent results from Go, especially having invested a bunch of work in making the various tools (linters, tests, development servers etc) as accessible as possible through documenting them in a Makefile.Armin also recently shared a half hour YouTube video in which he worked with Claude Code to resolve two medium complexity issues in his minijinja Rust templating library, resulting in PR #805 and PR #804.
Via @mitsuhiko.at
Tags: go, ai, llms, rust, ai-assisted-programming, coding-agents, generative-ai, armin-ronacher, anthropic, claude, claude-code
Disney and Universal Sue AI Company Midjourney for Copyright Infringement
Disney and Universal Sue AI Company Midjourney for Copyright Infringement
This is a big one. It's very easy to demonstrate that Midjourney will output images of copyright protected characters (like Darth Vader or Yoda) based on a short text prompt.There are already dozens of copyright lawsuits against AI companies winding through the US court system—including a class action lawsuit visual artists brought against Midjourney in 2023—but this is the first time major Hollywood studios have jumped into the fray.
Here's the lawsuit on Document Cloud - 110 pages, most of which are examples of supposedly infringing images.
Tags: ai-ethics, midjourney, generative-ai, training-data, ai, law
Breaking down ‘EchoLeak’, the First Zero-Click AI Vulnerability Enabling Data Exfiltration from Microsoft 365 Copilot
Disney and Universal Sue AI Company Midjourney for Copyright Infringement
Disney and Universal Sue AI Company Midjourney for Copyright Infringement
This is a big one. It's very easy to demonstrate that Midjourney will output images of copyright protected characters (like Darth Vader or Yoda) based on a short text prompt.There are already dozens of copyright lawsuits against AI companies winding through the US court system—including a class action lawsuit visual artists brought against Midjourney in 2023—but this is the first time major Hollywood studios have jumped into the fray.
Tags: ai-ethics, midjourney, generative-ai, training-data, ai, law
Malleable software
Quoting datarama
Quoting Ted Sanders
[on the cheaper o3] Not quantized. Weights are the same.
If we did change the model, we'd release it as a new model with a new name in the API (e.g., o3-turbo-2025-06-10). It would be very annoying to API customers if we ever silently changed models, so we never do this [1].
[1]
chatgpt-4o-latestbeing an explicit exception
— Ted Sanders, Research Manager, OpenAI
Tags: generative-ai, openai, o3, ai, llms
Quoting Sam Altman
(People are often curious about how much energy a ChatGPT query uses; the average query uses about 0.34 watt-hours, about what an oven would use in a little over one second, or a high-efficiency lightbulb would use in a couple of minutes. It also uses about 0.000085 gallons of water; roughly one fifteenth of a teaspoon.)
— Sam Altman, The Gentle Singularity
Tags: sam-altman, generative-ai, ai-energy-usage, openai, chatgpt, ai, llms
AI-assisted coding for teams that can't get away with vibes
o3-pro
o3 price drop
OpenAI just dropped the price of their o3 model by 80% - from $10/million input tokens and $40/million output tokens to just $2/million and $8/million for the very same model. This is in advance of the release of o3-pro which apparently is coming later today.
This is a pretty huge shake-up in LLM pricing. o3 is now priced the same as GPT 4.1, and slightly less than GPT-4o ($2.50/$10). It’s also less than Anthropic’s Claude Sonnet 4 ($3/$15) and Opus 4 ($15/$75) and sits in between Google’s Gemini 2.5 Pro for >200,00 tokens ($2.50/$15) and 2.5 Pro for <200,000 ($1.25/$10).
I’ve updated my llm-prices.com pricing calculator with the new rate.
How have they dropped the price so much? OpenAI's Adam Groth credits ongoing optimization work:
thanks to the engineers optimizing inferencing.
Tags: generative-ai, openai, o3, llm-pricing, ai, llms
Magistral — the first reasoning model by Mistral AI
Quoting David Crawshaw
The process of learning and experimenting with LLM-derived technology has been an exercise in humility. In general I love learning new things when the art of programming changes […] But LLMs, and more specifically Agents, affect the process of writing programs in a new and confusing way. Absolutely every fundamental assumption about how I work has to be questioned, and it ripples through all the experience I have accumulated. There are days when it feels like I would be better off if I did not know anything about programming and started from scratch. And it is still changing.
— David Crawshaw, How I program with Agents
Tags: coding-agents, ai-assisted-programming, generative-ai, ai-agents, ai, llms
OpenAI hits $10 billion in annual recurring revenue fueled by ChatGPT growth
OpenAI hits $10 billion in annual recurring revenue fueled by ChatGPT growth
Noteworthy because OpenAI revenue is a useful indicator of the direction of the generative AI industry in general, and frequently comes up in conversations about the sustainability of the current bubble.OpenAI has hit $10 billion in annual recurring revenue less than three years after launching its popular ChatGPT chatbot.
The figure includes sales from the company’s consumer products, ChatGPT business products and its application programming interface, or API. It excludes licensing revenue from Microsoft and large one-time deals, according to an OpenAI spokesperson.
For all of last year, OpenAI was around $5.5 billion in ARR. [...]
So these new numbers represent nearly double the ARR figures for last year.
Tags: openai, llms, ai, generative-ai