云端大语言模型淘金热即将结束

云端大语言模型淘金热即将结束
Cloud-based LLM gold rush is ending

原始链接: https://automato.substack.com/p/apple-wwdc-and-the-fable-5-embargo

作者认为，人工智能行业正趋于瓶颈。由于大语言模型（LLM）本质上是概率性的，若要替代确定性的业务流程，将面临无法维持的开发与监管成本。苹果公司向本地端侧AI处理的转型，反映出其务实地背离了昂贵且依赖云端的大模型模式。苹果并未参与追求通用人工智能（AGI）的竞赛，而是专注于以用户为中心的实用工具，这凸显了“前沿”大模型基准测试在现实世界中的价值可能不及宣传水平。作者认为，大模型的商业模式正承受压力：模型成本不断攀升，但其实际且可持续的应用场景依然有限。此外，作者警告不要将人工智能定义为“国家安全”问题，认为这种论调助长了军备竞赛心态，进而加剧全球冲突与割裂。归根结底，作者将大语言模型视为一种“增强工具”，认为其最佳用途是辅助而非替代人类判断。本文对早期采用者提出了警示：在层出不穷的新功能发布背后，行业可能正在转型，因为当前大模型架构的局限性已愈发不可忽视。

抱歉。

原文

Saturday, 13th, 2026

I wrote this article yesterday. I decided to let it mature before publishing. This is what I woke up to this morning:

This is exactly what I mean when writing about the weaponization of AI and its use as a national security concern. I did not change a word of the article.

Screenshot from Anthropic’s website.

Friday, 12th, 2026.

I think it is. Not the AI one, though.

Apple announced a few days ago at WWDC something that goes way beyond Siri improvements; it is a signal of what the AI world looks like today, and where it is going.

Let me explain.

Apple believes that for most uses, we don’t need cloud-based LLMs. They have decided that Mac OS should be an AI-enabled system that processes workflows and tasks locally.

Cloud systems can be used when needed. It makes sense, users will not need to buy a monthly subscription if their Mac has the power to run AI automations and tasks natively.

What does that mean for you and me? Probably most of our automations and Claude skills will eventually run on our Macs. We will likely have to rebuild our apps.

So what happens to LLMs? They are already hinting at where they intend to go: advanced AI work: agents, harnesses, deep reasoning tasks. Specialist work, not default infrastructure.

LLMs are genuinely useful, but our hopes and imagination may have been playing a role in our misinterpretation of what they are really good at. LLMs have a limitation by design: they are probabilistic in nature.

Probabilistic systems interpret context; they do not execute with certainty. Asking an LLM to scan invoices and always update a database correctly is more hope than a solution. That use case pushes a probabilistic system to behave as a deterministic one. So why not use a deterministic system instead? Many experienced early adopters will disagree, but I would love to know how many credits their agentic systems cost per transaction, how much maintenance time they require, and how long they took to build. And then I would ask:

Why not use the LLM to build the deterministic tool instead?

A well-architected system can manage the probabilistic nature of an LLM through validation layers, confidence scoring, and human review queues. That is true. But those layers have a cost — in development time, in maintenance, in the human oversight required to catch what the model gets wrong. That cost rarely appears in the business case. It surfaces later, in the people doing the work that the automation was supposed to eliminate.

云端大语言模型淘金热即将结束 Cloud-based LLM gold rush is ending

云端大语言模型淘金热即将结束
Cloud-based LLM gold rush is ending