How might a browser be developed?

Published by Paul Kinlan on: May 3, 2026; Reading time: 9 minutes

Expand to see summary

There’s a confluence of things that have happened recently that have made me question how browsers might be developed:

On January 8th 2026, Simon Willison predicted that someone will build a browser using mainly AI-assisted tools within 3 years
On January 27th 2026, one human and one agent built a browser from scratch (great experiments)
On the 19th February 2026, news came out that Taalas had a chip that can spit out nearly 17,000 tokens per second by baking a model onto the chip.

I would like to indulge in a little bit of projection and science fiction of my own.

Firstly, I think it is very cool that people are building browsers with LLMs, and as you can expect from a blog about the intersection of the web and AI, I’m on the positive side of this technology.

It turns out that a comprehensive spec and a heap of unit tests are a good way to keep the LLM in check and produce outcomes that should work.

While the Web Platform Tests project is huge with over 72,000 commits and 2,224,302 tests and a testament to the shared goals of browser vendors, it’s still nowhere near comprehensive enough.

From my own usage of LLMs and the anecdotes of many other developers, LLMs are very good at creating unit tests and unit tests are good guardrails for coding-agents. I expect to see Spec => Unit Test happening more and more with modern tooling. I also think it will make the specification process a lot more robust, and every browser’s implementation will get better as we strive for spec conformance.

If we can get this far with an LLM building a new browser today, and we have comprehensive specs and an even more robust test suite, it feels like there is a pretty straight line to a dramatic change in how all browsers are built:

A standards org could create a “canonical reference browser”. It’s not meant for production but to find all the edge cases in the spec and the test suite.
Browser vendors will monitor all test failures in their browser and fix them, or update the browser when a spec changes.
Browser vendors will increasingly use LLMs to implement features in the browser, and with a well-planned spec and a comprehensive test suite all features will be developed.

This clearly puts the emphasis on defining good specs and more investment there.

Browser engines will then differentiate consciously on whether the specs match their vision of the web, rather than on their own engineering capability or budgets.

OK, so we are building browsers with an LLM now, but I haven’t addressed point 3. While I mentioned Taalas, the point is about the progression of capability and speed of LLMs. And this is where I project a bit into the land of science fiction (or maybe just pure fantasy).

Let’s make some base assumptions:

Model quality is good enough to implement features given a good enough spec and tests
Model quality keeps improving following scaling laws
Model performance keeps improving on two fronts. Hardware FLOPS roughly doubles every 2.3 years across ML accelerators, and algorithmic efficiency halves the compute needed to reach a given level of performance roughly every 8 months.

In Whither CMS I show that you can build a server middleware that takes any markup and renders it on the fly. <carousel></carousel> becomes a working component. If the browser supports the carousel-related CSS primitives the middleware uses them; if not, it implements the feature in JS. It’s not practical today, but I think we will get to optimized, generated UIs quickly given the current performance trajectories.

I think us web-developers get a bit of a bum rap. Everyone is saying the tools will automate page generation. However, the web is the people that have the ideas and make the content… So I want to throw this one idea out to our browser-engineering friends: There is a point in the future where a web specification could be implemented in real-time in the browser…

My dog (Cwtch, she’s a good girl) and I were discussing this very topic whilst on a long walk the other week, trying to determine what would change if the browser is built around the page, instead of the page around the capabilities of the browser. What if the browser produces working components from the provided markup (heck, even just a description) at request time, and what would the implications be for the web and browsers?

Take a website that says “measure my heart rate from my Coros monitor and graph it”. Today, that needs WebBluetooth, the browser vendor’s prior decision to ship the API, the right Bluetooth profile, and whatever JS the developer writes on top. In an instantly-generated browser, the runtime already knows the device. The Bluetooth Heart Rate Service over GATT is documented hardware. The page describes the intent. The browser builds the binding on the fly, the sandbox enforces the boundary, and the user gets a working app.

Which means the role of a web spec changes. Today specs define a standardised interface that abstracts complexity, so people can access functionality consistently across browsers, securely. Bluetooth is already specced out as a hardware platform. LLMs are getting increasingly good at interfacing with hardware, either by reverse-engineering it or by reading the vendor’s guide. People are already generating against microcontrollers and getting working drivers out the other end. If the browser can generate the binding from the underlying hardware spec, the web’s redundant equivalent (WebBluetooth, and maybe dozens of others) stops being necessary. The web-platform then can be pared back to a minimal core, and everything above that is generated on demand from intent.

I’m picking on Web Bluetooth because it’s an easy example, but there’s an entire class of capabilities that machines have that parts of the web ecosystem want and I think at some point the interfaces could be dynamically built and teams don’t have the time or budget to implement.

Rendering is another area that could change a lot. A lot of the web’s UI development falls into two categories:

developer-experience improvements that make things already possible easier to build.
enabling things that have never been possible (see cross-page view-transitions).

In an instantly-generated browser, is the UI just WASM, Canvas, and WebGPU exposed to accessibility tools? I honestly don’t know, but Flipbook.page recently showed some compelling ideas about generating UI in a world of LLMs without HTML, and the A2UI protocol is another area that explores declarative JSON descriptions of UI that any client can render natively.

We’d still need to work out security and privacy too. The same-origin model and CSP would have to remain, alongside a lot of new primitives we don’t have yet (or know).

In a world of instant generation, what is the browser vendors role in deciding what happens? Do we want them to decide on capabilities?

If we do then then web standards for high-level features still make sense and it’s likely we go to a maximal extrame and the browser ships every feature (generated).
If we don’t then a likely path would take the browser to the minimal extreme, then web standards become the absolute minimum needed to make a secure and private runtime, and the runtime solves the rest.

Maybe there’s a third path and the browser vendors will provide “taste” (I really dislike the use of this term.)

A few second-order things happen if the second path wins. Polyfills disappear, because the generator handles backward compatibility. The “vendor gate” on shipping a new feature collapses, because if the runtime supports an intent description, every browser running a recent generator ships it. Browser engineering teams shift toward runtime, sandbox, and verification work, because one of the most labour-intensive parts of shipping a browser was always the per-feature implementation. And the security story flips: provenance becomes a real problem, because two users on the same URL might get different generated implementations and bug reports stop being reproducible without the generated artefact attached.

All of that is the engineering side. The user side is harder and more complex to reason about. I think there is a genuinely good version of this: Accessibility could become the default rather than something every site author has to remember to bolt on, because the browser generates the most accessible implementation for the user it’s actually serving. Internationalisation could come for free. Low-power devices could get a lean tailored version of any site rather than the same bloated one. People who can’t write code today could publish a site by writing prose. The web could become more accommodating, not less.

But the web’s promise was also that the same URL produced roughly the same experience for anyone with a browser. A flagship iPhone and a five-year-old Android were on the same web. In a generative world, your experience of any given page becomes a function of your device, your generator, and where they’re running. So people in regions where high-end hardware is unaffordable, or where the on-device model isn’t the best one, get a worse web by default. Not just slower, but less capable, less accurate, more error-prone. Sharing a URL with a friend stops meaning sharing an experience.

Urgh.

I’m personally more interested in exploring the minimal browser and the generated future a lot more. In this world the runtime can then evolve faster than a standards process (as long as you solve the sandbox security and privacy expecations), and it’s completely unexplored as a topic, so we would need to understand all of the issues and opportunities.

This was a really fun thought experiment for me that I wanted to share. We are many years out from having local hardware quick enough to build a browser at runtime. I’m very interested to see how browsers, and not just web development, change as LLMs evolve, and I think there are some low-hanging fruits when it comes to feature development today.

I believe with pretty high-confidence that the actual near-term future will be that browser vendors all start using tools to create features from comprehensive and well tested specs and more thought and rigour will go into that.

I do think that we (the industry) really need to spend a lot more time thinking about what happens to the browser as LLMs progress a lot more than we currently do and we need to do it soon because I believe we’re already on the path to “autonomous user-agents” with the new generation of Super-apps that centralise and control content delivery, and they might be the tools people prefer in the future.

AI Focus

How might a browser be developed?

Subscribe

agent-do: my agent loop - 2026-05-01

webmcp is the new web intents ... maybe - 2026-04-27

damn claude, that's a lot of commits - 2026-03-30

the token salary - 2026-03-27

the llm whisperer - 2026-03-08

the prompt is the program - 2026-02-21

If NotebookLM was a web browser - 2026-01-25

the browser is the sandbox - 2026-01-25

projects - 2026-01-02

hyper content negotiation - 2025-11-27

headless stopgap - 2025-11-23

dead framework theory - 2025-10-12

interception - 2025-09-21

dangerous - 2025-08-22

hypermedia - 2025-08-18

elements - 2025-07-16

Whither CMS? - 2025-07-05

token slinging - 2025-06-30

on-device - 2025-06-12

AI Assisted Web Development - 2025-06-04

embedding - 2025-05-28

Mashups 2.0 - 2025-05-24

latency - 2025-05-22

A link is all you need - 2025-05-17

super-apps - 2025-05-12

transition - 2025-05-09

How might a browser be developed?

Subscribe

Share this post

agent-do: my agent loop - 2026-05-01

webmcp is the new web intents ... maybe - 2026-04-27

damn claude, that's a lot of commits - 2026-03-30

the token salary - 2026-03-27

the llm whisperer - 2026-03-08

the prompt is the program - 2026-02-21

If NotebookLM was a web browser - 2026-01-25

the browser is the sandbox - 2026-01-25

projects - 2026-01-02

hyper content negotiation - 2025-11-27

headless stopgap - 2025-11-23

dead framework theory - 2025-10-12

interception - 2025-09-21

dangerous - 2025-08-22

hypermedia - 2025-08-18

elements - 2025-07-16

Whither CMS? - 2025-07-05

token slinging - 2025-06-30

on-device - 2025-06-12

AI Assisted Web Development - 2025-06-04

embedding - 2025-05-28

Mashups 2.0 - 2025-05-24

latency - 2025-05-22

A link is all you need - 2025-05-17

super-apps - 2025-05-12

transition - 2025-05-09