Tuesday, August 12, 2025

Benedict's Newsletter: No. 604

NO. 604   FREE EDITION   SUNDAY 10 AUG 2025
SPONSORED BY LINEAR
Meet the system for modern software development

Linear is the purpose-built tool for planning and building products. Roadmaps, PRDs, issues — everything lives in one place.

Request a demo

My work

AI eats the world

Every year, I produce a big presentation exploring macro and strategic trends in the tech industry. New in summer 2025, 'AI eats the world'. LINK

News

OpenAI does Open, and launches ChatGPT5

OpenAI aimed for two big splashes this week, releasing a set of open-source models for the first time since 2019 and launching GPT5. 

OpenAI was supposed to be, well, open, but it stopped making the models public as a matter of responsibility and principle on the claim that this was too 'dangerous'. However, first Meta and then a wave of Chinese companies have released a series of best-in-class open models, and so Sam Altman has been strategising: relevance might be more important than principles. GPT-OSS does nothing very different, but it comes in the standard range of sizes and weights and gets into the top 10 or so on most benchmarks: if you need to run and customise models yourself, Llama is too far behind and you can't use the Chinese models, OpenAI is there for you, presuming of course you trust OpenAI not to change course again. Sam Altman is playing chess again. 

GPT5 is a more complex story: it continues the steady improvement of LLMs and puts OpenAI back at the top of the benchmarks but isn't reallya step change in capability, with one exception. Like most model releases these days, 'GPT5' is actually a family of large and small models, trading off cost and quality versus speed and ability to run on a PC versus a server farm. But that has given us the much-mocked 'model picker', where you're supposed to know if your task needs GPTo3, o4, 4o or Mini Pro Plus. GPT5 is a system that comes with a component that acts as a router, deciding for itself which underlying model to send your task to. See this week's column. GPT-5OPEN SOURCE

Google does video simulation

The third big story this week was Google's release of Genie 3, which can generate realistic 3D worlds in real time as you move through them. AAA games have become enormously expensive to produce in recent years, with huge numbers of people away building all of those elastic environments by hand; it seems pretty clear that AI will automate a lot of that, at a minimum, and it may also lead to entirely new kinds of experiences, much as 3D itself or networking did. LINK

The week in AI

Google has been talking to advertisers about its plans to include ads in 'AI Mode' search results. LINK

Elon Musk also said he plans to include ads in results from xAI's LLM Grok (the one that called itself 'MechaHitler'), building on his success in attracting advertisers back to Twitter. LINK

Cloudflare (CDN) accused Perplexity of not just ignoring robots.txt requests not to crawl websites, but of hiding the identity of its crawlers to read websites that are actively trying to block it. LINK

Eleven Music's latest generator is out, and worth playing with. I'm old enough to remember when synths destroyed creativity. LINK

News from autonomy 

Tesla shut down its 'Dojo' project to build its own supercomputer to analyse driving data (which last year was supposedly worth tens of billions of dollars) - this might be because Elon Musk's xAI has plenty of its own compute available on easy terms to Tesla. LINK

Meanwhile, Amazon's Zoox got clearance to test in public. Zoox has the unusual approach to autonomy of making an entirely new vehicle, which I honestly don't understand - once autonomy works then cars can be redesigned with no steering wheel, sure, but why spend the money on that in advance? LINK

And further out again, Joby (electric helicopters) is buying Blade, which does helicopter shuttles around NYC, planning to use it as a route-to-market. LINK

Apple gives Trump a trinket 

In 'Godfather II', the Cuban representative of ITT gave President Batista a solid gold telephone. In 2025, Apple's Tim Cook gave President Trump a piece of Corning Glass on a gold plinth. LINKCUBA

About
What matters in tech? What's going on, what might it mean, and what will happen next?

I've spent 25 years analysing mobile, media and technology, and worked in equity research, strategy, consulting and venture capital. I'm now an independent analyst, and I speak and consult on strategy and technology for companies around the world.

Ideas

Google published a blog post pushing back on the narrative that its AI Overviews have slashed traffic to web publishers. There's a lot of wiggle-room in the language it uses (referral traffic is 'relatively stable' - what does that mean?), but the more important underlying point is that the move to LLMs will change the distribution patterns. LLMs will change what kinds of searches people do and what kinds of sites they visit, not just shift traffic from publishers to Google. LINK

Yet another attempt to analyse jobs exposed to AI, this one from the EIG. Reading it, though, I'm struck by an interesting implicit assumption that's common to a lot of this kind of work: that the more 'in person' and 'physical' a job is, the harder it is to capture with AI. In particular, it suggests that physical trainers are much harder to automate. But couldn't we have personalised AI physical trainers, at scale, with generative video and live analysis of what you're doing? Maybe that will convert to AI much faster than some 'desk-based' jobs? This reminds me a little of the notorious attempt to analyse the TAM for Uber by calculating the TAM for taxis: disruption changes our definitions of the market. LINK

Shopify has always wanted to move up the stack from commodity SaaS provider to build a network that can make recommendations and drive traffic to merchants. Now AI makes that a lot more complex. LINK

Analysing the effect of the AI capex boom on the broader US economy. LINK

Apollo's first move into data centres as real estate investment. LINK

Meta's paper on how it uses AI to rewrite ads to get better response rates. LINK

The former head of EY UK says AI will be big for new entrants, and takes board seats at new entrants. LINK

It's very unclear what AI means for consultants and accountants, but 'machines that can write code' are clearly a very big deal for the ~$300bn Indian IT outsourcing industry. LINK

A taxonomy of LLM 'hallucinations'. LINK

Why Apple cares about F1. LINK

Amazon's 'ad-stick' retail advertising dongle. LINK

Russia and Ukraine are using huge numbers of cheap battlefield drones but are still mostly in stalemate. How do drones change from a tool in existing force structures and tactics to a new way to win? LINK

Outside interests

I decided to bury the most important story of the week down here - AOL is discontinuing its dial-up internet service. LINK

Data

A survey of AI use by newsrooms. LINK

Sky News got data on UK 'de minimus' imports from China - the loophole just closed by the USA that let Shein and Temu ship directly from China to the consumer without paying import duties. The US figure in 2024 was about £6bn (~$8bn), compared to a bit over $30bn for the USA. LINK

Upgrade to Premium
You're getting the Free edition. Subscribers to the Premium edition got this two days ago on Sunday evening, together with an exclusive column, complete access to the archive of over 600 issues, and more.

Preview from the Premium edition

AI products 

It seems to me that in the last two years, generative AI has developed in two ways. On one hand, the models get 'better' and on the other, the labs companies try to wrap things around them - 'thin GPT wrappers' - to turn them into products. If you're not an AI researcher, GPT5 does nothing very interesting for the first of these - it continues the steady, incremental progress we've seen since 2022. But it represents a pretty useful step towards a product. 

I've written a few times that LLMs are a raw technology that looks like a product. Because the nature of the technology itself is that you can type in natural language questions and get answers, you don't need to be an engineer or need any training to start using them, unlike, say, SQL. You can just ask for what you want! 

But the reason we have thousands of different 'database' products for different use cases instead of just one is not only that SQL is not based on natural language - it's that those use cases work a lot better when you have dedicated UI, tooling, connections, and decisions about how they should work around them - when they become product. We don't have GUIs only because C++ and SQL are hard - we have GUIs because it's hard to work out what the task is and how it should work from scratch at a blank screen. The GUI represents decisions and institutional knowledge about the problem and the task. 

.

THIS IS A PREVIEW FROM THE PREMIUM EDITION - PREMIUM SUBSCRIBERS GET THE COMPLETE COLUMN EVERY WEEK. YOU SHOULD UPGRADE.

 

No comments:

Post a Comment

This Too Shall Pass (It Always Has)

​ ​ It's so easy to idealize the past. As if people haven't always been deranged. As if things haven't always ...