Cloudflare CEO 在机器人流量激增的问题上对你撒了谎
Cloudflare CEO Is Lying to You About the Bot Traffic Jump

原始链接: https://www.flyingpenguin.com/cloudflare-ceo-is-lying-to-you-about-the-bot-traffic-jump/

文中指出,Cloudflare 首席执行官马修·普林斯(Matthew Prince)声称机器人流量已超过人类流量,这是一种歪曲公司自身数据的“魔术”骗局。 作者认为,普林斯通过选择性地引用“仅限 HTML”的流量统计数据来制造虚假叙事,却无视了他自己仪表盘上显示的“全部”流量数据——后者证实约三分之二的互联网流量仍来自人类。此外,该评论反驳了普林斯将“代理型”人工智能机器人视为流量增长主要动力的说法。作者指出,“代理型”流量在统计学上微不足道,而人工智能相关流量的实际增长,是由用于训练大语言模型的大规模抓取机器人所驱动的。 最终,文章认为这一叙事是一种经过精心计算的销售策略,旨在将其“付费抓取”服务商业化。通过将大规模抓取工具与代理型工具混为一谈并歪曲整体数据,这位首席执行官被指控编造了一种危言耸听的趋势,以谋取商业利益。

近期 Hacker News 上的一场讨论对 Cloudflare 首席执行官的言论提出了质疑,该言论称互联网历史上机器人流量首次超过了人类流量。 批评者认为这一说法具有误导性,指出其仅通过专注于 HTML 请求来筛选数据。当分析“所有流量”(包括图像、CSS 和其他资源)时,人类流量依然显著高于机器人流量。评论者指出,虽然机器人流量确实非常庞大,但对“代理流量”(agentic traffic)的定义很复杂,且该首席执行官的言论可能更多是出于营销炒作,而非对网络数据的全面审视。 除了这一具体主张外,这场辩论还凸显了人们对 Cloudflare 作为互联网无处不在的“中间人”角色的深层担忧。许多参与者表达了对该公司在流量方面拥有巨大中心化控制权的不安,并警告称这种垄断在数据监控、政府访问以及产生不可靠的“黑箱”统计数据方面构成了重大风险。尽管一些支持者认为 Cloudflare 只是满足了市场对 DDoS 防护和效率的需求,但批评者的共识是,该公司对现代网络生态系统的影响力是危险的,需要接受更严格的审查。
相关文章

原文

A magic trick is a trick. It misrepresents reality. What the Cloudflare CEO published as bot traffic increasing, is a trick. He misrepresents reality.

The explanation is simple.

The claim is false as stated:

…bots passed human traffic online for the first time in the Internet’s history.

The Cloudflare data shows online traffic is still about two thirds human, not the higher amount being claimed. The CEO ignored the all-traffic number, on his own dashboard, and instead published the HTML-only number as a fact about the whole internet.

That is a lie about what the data shows, and the “All” selector on his own page proves it.

The category Prince points to as the cause contradicts him. Agentic is tiny. What actually fills the AI bucket is training scrapers, like GPTBot and ClaudeBot, pulling text to build models, which have been climbing steadily and predate his announcement. He blamed a friendly, fast-growing sliver of agents fetching pages for people and swapped in unfriendly bulk (mass scraping for training). Why? We can guess, but that is exactly the traffic his pay-to-crawl product exists to bill.

It’s a sales pitch.

And it’s based on a lie.

The actual data shows search crawlers are the largest bot category by a factor of two, the AI number is padded by counting Googlebot twice, the AI traffic that does exist is mostly training scrapers, and the agentic category Prince points to as the cause is the smallest bucket in his own company’s classification. His “agentic” increase press release is disproven by his dataset.

联系我们 contact @ memedata.com