云端大语言模型淘金热即将结束
Cloud-based LLM gold rush is ending

原始链接: https://automato.substack.com/p/apple-wwdc-and-the-fable-5-embargo

作者认为,人工智能行业正趋于瓶颈。由于大语言模型(LLM)本质上是概率性的,若要替代确定性的业务流程,将面临无法维持的开发与监管成本。 苹果公司向本地端侧AI处理的转型,反映出其务实地背离了昂贵且依赖云端的大模型模式。苹果并未参与追求通用人工智能(AGI)的竞赛,而是专注于以用户为中心的实用工具,这凸显了“前沿”大模型基准测试在现实世界中的价值可能不及宣传水平。作者认为,大模型的商业模式正承受压力:模型成本不断攀升,但其实际且可持续的应用场景依然有限。 此外,作者警告不要将人工智能定义为“国家安全”问题,认为这种论调助长了军备竞赛心态,进而加剧全球冲突与割裂。归根结底,作者将大语言模型视为一种“增强工具”,认为其最佳用途是辅助而非替代人类判断。本文对早期采用者提出了警示:在层出不穷的新功能发布背后,行业可能正在转型,因为当前大模型架构的局限性已愈发不可忽视。

抱歉。
相关文章

原文

Saturday, 13th, 2026

I wrote this article yesterday. I decided to let it mature before publishing. This is what I woke up to this morning:

This is exactly what I mean when writing about the weaponization of AI and its use as a national security concern. I did not change a word of the article.

Screenshot from Anthropic’s website.

Friday, 12th, 2026.

I think it is. Not the AI one, though.

Apple announced a few days ago at WWDC something that goes way beyond Siri improvements; it is a signal of what the AI world looks like today, and where it is going.

Let me explain.

Apple believes that for most uses, we don’t need cloud-based LLMs. They have decided that Mac OS should be an AI-enabled system that processes workflows and tasks locally.

Cloud systems can be used when needed. It makes sense, users will not need to buy a monthly subscription if their Mac has the power to run AI automations and tasks natively.

What does that mean for you and me? Probably most of our automations and Claude skills will eventually run on our Macs. We will likely have to rebuild our apps.

So what happens to LLMs? They are already hinting at where they intend to go: advanced AI work: agents, harnesses, deep reasoning tasks. Specialist work, not default infrastructure.

LLMs are genuinely useful, but our hopes and imagination may have been playing a role in our misinterpretation of what they are really good at. LLMs have a limitation by design: they are probabilistic in nature.

Probabilistic systems interpret context; they do not execute with certainty. Asking an LLM to scan invoices and always update a database correctly is more hope than a solution. That use case pushes a probabilistic system to behave as a deterministic one. So why not use a deterministic system instead? Many experienced early adopters will disagree, but I would love to know how many credits their agentic systems cost per transaction, how much maintenance time they require, and how long they took to build. And then I would ask:

Why not use the LLM to build the deterministic tool instead?

A well-architected system can manage the probabilistic nature of an LLM through validation layers, confidence scoring, and human review queues. That is true. But those layers have a cost — in development time, in maintenance, in the human oversight required to catch what the model gets wrong. That cost rarely appears in the business case. It surfaces later, in the people doing the work that the automation was supposed to eliminate.

Leave a comment

That brings me to what LLMs genuinely do well:

  • Democratizing software development — removing the technical barrier while the human still directs

  • Accelerating learning — removing the access barrier while the human still synthesizes

  • Interpretation aid — reducing cognitive load while the human still decides

  • Language and translation work — removing friction while the human still owns the meaning

Notice the pattern. In every case, the human remains essential. LLMs are amplification tools: they speed up and increase in size what we want to do, and also our mistakes.

The consumer marketing around AGI has gone quiet. The labs themselves are more AGI-focused than ever, as it is in their stated missions and research agendas. But the public narrative has shifted toward practical features and monthly subscriptions.

Apple’s decision to focus on local, practical AI for its users rather than chasing frontier model benchmarks is telling. It suggests that the race toward artificial general intelligence may be less central to real-world value than the industry was claiming.

This makes me suspect that OpenAI, Google, and Anthropic are working on very different models behind closed doors, experiments we do not know about yet, because the current LLM approach has a ceiling, and they know it.

I find it especially disturbing when I hear people framing AI as a national security issue. That framing does not produce wisdom; it produces escalation.

Every major technology of the last century that got absorbed into a power and dominance narrative ended up generating conflict rather than progress.

Other nations and blocs will not watch passively. They will build alternatives, restrict access, and retaliate through regulation and competing investment.

The arms-race framing fragments the technology rather than developing it. Apple, interestingly, offers a partial counterexample:

I am not saying AI is over. I am saying LLMs are hitting a wall. The LLM business model is under pressure, not because the technology has stopped improving, but because the cost of accessing that improvement keeps rising while the sustainable use cases for most businesses and independent professionals remain narrower than advertised.

I do not want to be distracted by the barrage of new functionality announcements; they are good manipulation tactics. The truth reveals itself more clearly when you look at the subscription models and the escalating credit prices.

I am an AI early adopter myself, and I write Automato precisely because I want to connect with people interested in this space, sharing value and mutually benefiting from the opportunities this innovation might bring.

But my decades in the industry hint at the fact that our enthusiasm often hides the underlayers of what is really happening. So this article is also a warning to myself:

What is your take?

Leave a comment

I wish you a very good day!

Jose from Automato

联系我们 contact @ memedata.com