如果浏览器为你构建用户界面呢？

如果浏览器为你构建用户界面呢？
What if the browser built the UI for you?

原始链接: https://jonno.nz/posts/what-if-your-browser-built-the-ui-for-you/

## 前端开发的未来：浏览器驱动的UI 前端开发正处于一个转折点。尽管人工智能UI生成技术有所进步，大多数SaaS产品仍然依赖于定制构建的界面，导致重复劳动和不一致的用户体验。一种潜在的解决方案？将UI生成转移到浏览器本身。目前，服务器驱动UI和生成式UI等方法仍然将控制权掌握在服务提供商手中。作者提出了一种激进的转变：服务发布一份详细描述其能力和数据的“manifest”（清单），而浏览器则根据*用户*偏好（字体大小、配色方案、可访问性需求）生成UI，这些偏好将普遍应用于所有服务。这个“自适应浏览器”的概念解决了关键问题：跨集成的不一致UI模式和可访问性不统一。可访问性成为默认设置，而不是事后才考虑的问题。虽然复杂性转移到API设计和语义数据契约上，但这鼓励了更丰富、更标准化的API——有效地使API成为核心产品。这个未来，可能在3-5年内实现，设想浏览器作为智能用户代理，通过这些API拼接来自各种服务的上下文数据。成功与否不在于漂亮的UI，而在于强大的API和有用的数据，组织机构需要在定义的边界内设置定制化的限制。前端不会消亡，但其开发正在向API设计和浏览器智能演变。

## 浏览器内置用户界面：摘要最近在Hacker News上进行了一场讨论，探讨了浏览器利用LLM动态生成用户界面的想法，将控制权从网站开发者转移到用户手中。核心概念是构建一个网站通过API暴露数据的网络，浏览器根据用户偏好构建个性化界面——本质上将网站变成数据源，而不是固定的体验。一些人认为这具有提高可访问性、定制性和效率的潜力，但许多人对此表示反对。担忧集中在品牌上——公司希望控制其形象，以及精心策划的艺术设计在传达超越原始功能的价值方面的重要性。还有人强调了支持、文档和保持一致性的实际挑战。这场争论触及了简化、数据驱动的网络与人类对审美体验和品牌认同的需求之间的紧张关系。完全取代传统网页设计似乎不太可能，但这个想法引发了关于网络交互未来的讨论，可能会影响SaaS产品的构建和使用方式，以及用户通过AI代理与数据交互的方式。最终，可行性取决于成本、复杂性和用户是否将个性化优先于既定的品牌体验。

原文

We're at a genuinely weird inflection point in frontend development. AI can generate entire interfaces now. LLMs can reason about data and layout. And yet — most SaaS products still ship hand-crafted React apps, each building its own UI, its own accessibility layer, its own theme system, its own responsive breakpoints. Not every service, but the vast majority.

That's a lot of duplicated effort for what's essentially the same job — showing a human some data and letting them do stuff with it.

I've been thinking about this a lot lately, and I built a proof of concept to test an idea: what if the browser itself generated the UI?

Where we are right now

The industry is circling this idea from multiple angles, but nobody's quite landed on it yet.

Server-driven UI has been around for a while — Airbnb and others pioneered it for mobile, where app store review cycles make shipping UI changes painful. The server sends down a JSON tree describing what to render, and the client just follows instructions. It's clever, but the server is still calling the shots. x.

Google recently shipped Natively Adaptive Interfaces — a framework that uses AI agents to make accessibility a default rather than an afterthought. Really cool idea, and the right instinct. But it's still operating within a single app's boundaries. Your accessibility preferences don't carry between Google's products and, say, your project management tool.

Then there's the generative UI wave — CopilotKit, Vercel's AI SDK, and others building frameworks where LLMs generate components on the fly. These are powerful developer tools, but they're still developer tools. The generation happens at build time or on the server. The service is still in control.

See the pattern? Every approach keeps the power on the service side.

Flip it

Here's the idea behind the adaptive browser: what if the generation happened on your side?

Instead of a service shipping you a finished frontend, it publishes a manifest — a structured description of what it can do. Its capabilities, endpoints, data shapes, what actions are available. Think of it like an API spec, but semantic. Not just "here's a GET endpoint" but "here's a list of repositories, they're sortable by stars and language, you can create, delete, star, or fork them."

Your browser takes that manifest, calls the actual APIs, gets real data back, and then generates the UI based on your preferences. Your font size. Your colour scheme. Your preferred layout (tables vs cards vs kanban). Your accessibility needs. All applied universally, across every service.

The manifest for something like GitHub looks roughly like this — a service describes its capabilities and the browser figures out the rest:

service:
  name: "GitHub"
  domain: "api.github.com"

capabilities:
  - id: "repositories"
    endpoints:
      - path: "/user/repos"
        semantic: "list"
        entity: "repository"
        sortable_fields: [name, updated_at, stargazers_count]
        actions: [create, delete, star, fork]

The browser takes that, fetches the data, and generates a bespoke interface — using an LLM to reason about the best way to present it given who you are and what you're trying to do.

Why this matters more than it sounds

When I was building the app store and integrations platforms at Xero, one of the constant headaches was that every third-party integration had its own UI patterns. Users had to learn a new interface for every app they connected. If the browser was generating the UI from a shared set of preferences, that problem just… goes away.

Accessibility is the big one though. Right now, accessibility is a feature that gets bolted on — and often badly. When the browser generates the UI, accessibility isn't a feature. It's the default. Your preferences — high contrast, keyboard-first navigation, screen reader optimisation, larger text — apply everywhere. Not because every developer remembered to implement them, but because they're baked into how the UI gets generated in the first place.

Customisation becomes genuinely personal too. Not "pick from three themes the developer made" but "this is how I interact with software, full stop."

The trade-off is real though

Frontend complexity drops dramatically, but the complexity doesn't disappear — it moves behind the API. And honestly, it probably increases.

API design becomes way more important. You can't just throw together some REST endpoints and call it a day. Your manifest needs to be semantic — describing what the data means, not just what shape it is. Data contracts between services matter more. Versioning matters more.

graph LR
    A[Service] -->|Publishes manifest + APIs| B[Browser Agent]
    C[User Preferences] --> B
    D[Org Guardrails] --> B
    B -->|Generates| E[Bespoke UI]

But here's the thing — this trade-off pushes us somewhere genuinely interesting. If every service needs to describe itself semantically through APIs and manifests, those APIs become the actual product surface. Not the frontend. The APIs.

And once APIs are the product surface, sharing context between platforms becomes the interesting problem. Your project management tool knows what you're working on. Your email client knows who you're talking to. Your code editor knows what you're building. Right now, none of these talk to each other in any meaningful way because they're all locked behind their own UIs. In a manifest-driven world, that context flows through the APIs — and your browser can stitch it all together into something coherent.

Where this is headed (IMHO)

I reckon we're about 3-5 years from this being mainstream. The pieces are all there — LLMs that can reason about UI, standardisation efforts around sending UI intent over APIs, and a growing expectation from users that software should adapt to them, not the other way around.

The services that win in this world won't be the ones with the prettiest hand-crafted UI. They'll be the ones with the best APIs, the richest manifests, and the most useful data. The frontend becomes a generated output, not a hand-crafted input.

Organisations will set preference guardrails — "our people can use dark or light mode, must have destructive action confirmations, these fields are always visible" — while individuals customise within those bounds. Your browser becomes your agent, not just a renderer.

I built the adaptive browser as a proof of concept to test this thinking — it uses Claude to generate UIs from a GitHub manifest and user preferences defined in YAML. It's rough, but the direction feels right.

The frontend isn't dying. But what we think of as "frontend development" is about to change. The interesting work moves to API design, semantic data contracts, and building browsers smart enough to be genuine user agents.