Simon Willison's Weblog
- Author
- Simon Willison
- Public lists
-
Featured
- Fetched
From Async/Await to Virtual Threads
Re-label the "Save" button to be "Publish", to better indicate to users the outcomes of their action
Faster inference
Deep Think in the Gemini app
July newsletter for sponors is out
This morning I sent out the third edition of my LLM digest newsletter for my $10/month and higher sponsors on GitHub. It included the following section headers:
- Claude Code
- Model releases in July
- Gold medal performances in the IMO
- Reverse engineering system prompts
- Tools I'm using at the moment
The newsletter is a condensed summary of highlights from the past month of my blog. I published 98 posts in July - the concept for the newsletter is that you can pay me for the version that only takes 10 minutes to read!
Here are the newsletters I sent out for June 2025 and May 2025, if you want a taste of what you'll be getting as a sponsor. New sponsors instantly get access to the archive of previous newsletters, including the one I sent this morning.
Tags: newsletter
Quoting Logan Kilpatrick
Gemini Deep Think, our SOTA model with parallel thinking that won the IMO Gold Medal 🥇, is now available in the Gemini App for Ultra subscribers!! [...]
Quick correction: this is a variation of our IMO gold model that is faster and more optimized for daily use! We are also giving the IMO gold full model to a set of mathematicians to test the value of the full capabilities.
— Logan Kilpatrick, announcing Gemini Deep Think
Tags: gemini, logan-kilpatrick, llm-reasoning, ai, llms, llm-release, google, generative-ai
Reverse engineering some updates to Claude
More model releases on 31st July
Quoting Christina Wodtke
Trying out Qwen3 Coder Flash using LM Studio and Open WebUI and LLM
Ollama's new app
The one missing feature to date has been an interface: Ollama has been exclusively command-line, which is fine for the CLI literate among us and not much use for everyone else.
They've finally fixed that! The new app's interface is accessible from the existing system tray menu and lets you chat with any of your installed models. Vision models can accept images through the new interface as well.
Via Hacker News
Tags: ai, generative-ai, local-llms, llms, ollama
Quoting Steve Krouse
When you vibe code, you are incurring tech debt as fast as the LLM can spit it out. Which is why vibe coding is perfect for prototypes and throwaway projects: It's only legacy code if you have to maintain it! [...]
The worst possible situation is to have a non-programmer vibe code a large project that they intend to maintain. This would be the equivalent of giving a credit card to a child without first explaining the concept of debt. [...]
If you don't understand the code, your only recourse is to ask AI to fix it for you, which is like paying off credit card debt with another credit card.
— Steve Krouse, Vibe code is legacy code
Tags: vibe-coding, ai-assisted-programming, generative-ai, steve-krouse, ai, llms
The best available open weight LLMs now come from China
Qwen3-30B-A3B-Thinking-2507
OpenAI: Introducing study mode
Qwen/Qwen3-30B-A3B-Instruct-2507
Quoting Nilay Patel
Our plan is to build direct traffic to our site. and newsletters just one kind of direct traffic in the end. I don’t intend to ever rely on someone else’s distribution ever again ;
— Nilay Patel, on The Verge's new newsletter strategy
Tags: nilay-patel, journalism, email
My 2.5 year old laptop can write Space Invaders in JavaScript now
Quoting Anthropic
We’re rolling out new weekly rate limits for Claude Pro and Max in late August. We estimate they’ll apply to less than 5% of subscribers based on current usage. [...]
Some of the biggest Claude Code fans are running it continuously in the background, 24/7.
These uses are remarkable and we want to enable them. But a few outlying cases are very costly to support. For example, one user consumed tens of thousands in model usage on a $200 plan.
— Anthropic
Tags: anthropic, claude-code, llm-pricing, generative-ai, ai, llms
GLM-4.5: Reasoning, Coding, and Agentic Abililties
The many, many, many JavaScript runtimes of the last decade
The many, many, many JavaScript runtimes of the last decade
Extraordinary piece of writing by Jamie Birch who spent over a year putting together this comprehensive reference to JavaScript runtimes. It covers everything from Node.js, Deno, Electron, AWS Lambda, Cloudflare Workers and Bun all the way to much smaller projects idea like dukluv and txiki.js.Via Hacker News
Tags: javascript, nodejs, deno
TIL: Exception.add_note
.add_note(message: str) method to the BaseException class, which means you can add one or more extra notes to any Python exception and they'll be displayed in the stacktrace!
Here's PEP 678 – Enriching Exceptions with Notes by Zac Hatfield-Dodds proposing the new feature back in 2021.
Via Lobste.rs
Enough AI copilots! We need AI HUDs
Enough AI copilots! We need AI HUDs
Geoffrey Litt compares Copilots - AI assistants that you engage in dialog with and work with you to complete a task - with HUDs, Head-Up Displays, which enhance your working environment in less intrusive ways.He uses spellcheck as an obvious example, providing underlines for incorrectly spelt words, and then suggests his AI-implemented custom debugging UI as a more ambitious implementation of that pattern.
Plenty of people have expressed interest in LLM-backed interfaces that go beyond chat or editor autocomplete. I think HUDs offer a really interesting way to frame one approach to that design challenge.
Tags: design, design-patterns, ai, generative-ai, llms, geoffrey-litt
Official statement from Tea on their data leak
Qwen3-235B-A22B-Thinking-2507
Using GitHub Spark to reverse engineer GitHub Spark
Quoting Recurse Center
I Drank Every Cocktail
Adam's write up is delightful, incorporating pedantry, data nerdery, a trip to the Internet Archive, some excellent bar recommendations in New York and London and hints at elicit rum smuggling to help make the final cocktail, the IBA Tiki, using two different Havana Club rums that are illegal in the USA thanks to import restrictions.
Via Andy Baio
Tags: cocktails
Instagram Reel: Veo 3 paid preview
Instagram Reel: Veo 3 paid preview
@googlefordevs on Instagram published this reel featuring Christina Warren with prompting tips for the new Veo 3 paid preview (mp4 copy here).
(Christine checked first if I minded them using that concept. I did not!)
Tags: google, ai, generative-ai, gemini, pelican-riding-a-bicycle, text-to-video