美国政府指令:暂停 Fable 5 和 Mythos 5 的访问权限
Statement on US government directive to suspend access to Fable 5 and Mythos 5

原始链接: https://www.anthropic.com/news/fable-mythos-access

美国政府已下令 Anthropic 立即暂停对其 Fable 5 和 Mythos 5 模型的所有访问权限,理由是涉及潜在“越狱”漏洞的国家安全担忧。该指令适用于所有用户,包括身为外国籍的 Anthropic 员工。 Anthropic 正在遵守该命令,但强烈反对政府的理由。该公司认为,所谓的漏洞是轻微的、非普遍性的,且存在于其他公开可用的模型中。Anthropic 声称其模型采用了强大的“纵深防御”策略,使其安全性达到甚至超过行业标准。他们坚持认为,政府采取的这种极端反应——基于如此有限的证据召回广泛部署的商业模型——如果一致应用于整个行业,将有效地阻碍所有前沿人工智能的发展。 尽管该公司对服务中断表示歉意,但批评了政府处理过程中缺乏透明度的问题。Anthropic 目前正致力于解决其所称的“误解”,并预计在 24 小时内提供进一步更新。Anthropic 其他所有模型的访问权限均不受影响。

美国政府发布了一项出口管制指令,要求 Anthropic 公司停止向所有外国公民(包括居住在美国境内的外国公民)提供其“Fable 5”和“Mythos 5”AI 模型的使用权限。Anthropic 表示,政府此举源于对潜在“越狱”漏洞的担忧,但该公司认为这种功能在其他模型中非常普遍,并不构成独特的国家安全威胁。 该声明在 Hacker News 上引发了激烈讨论。许多用户批评 Anthropic 此前的言论,认为该公司过去对 AI 危险的“危言耸听”助长了监管机构的反弹,而现在这种反弹正威胁到其自身业务。批评者认为,此次干预可能带有报复性、政治动机,或是“监管俘获”的一个例子,可能会扼杀竞争,并将国际用户推向中国开发的模型。其他人则质疑该指令的法律效力和实际可行性,因为核实每一位 API 用户的国籍存在困难。尽管一些人对美国科技政策的不可预测性表示沮丧,但许多参与者将此次事件视为一种表演性的“营销”时刻,并预测该限制措施将是短暂的,或难以得到统一执行。
相关文章

原文

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Anthropic models will not be affected.

We received the directive from the government today at 5:21pm (ET). The letter did not provide specific details of its national security concern. Our understanding is that the government believes it has become aware of a method of bypassing, or “jailbreaking” Fable 5. We reviewed a demonstration of this specific technique being used to identify a small number of previously known, minor vulnerabilities. These vulnerabilities all appear relatively simple, and we have found that other publicly-available models are able to discover them as well without requiring a bypass.

Anthropic’s posture with respect to Fable’s safeguards, as laid out in our launch blog post, is the following:

  • We have instituted strong safeguards that greatly reduce the likelihood that Fable is misused for tasks related to cybersecurity (among others). In fact, our safeguards are so strong that many users have complained that they are overly broad.
  • In the weeks leading up to the launch of Fable, Anthropic worked with the US government, the UK AISI, multiple private third-party organizations and internal teams to red-team Fable’s safeguards for thousands of hours in total.
  • These tests showed that Fable’s safeguards are substantially more effective than those of any previously deployed model.
  • No testers have yet been able to find a universal jailbreak—a jailbreak method that can very broadly bypass the model’s safeguards, unblocking a wide range of cyber capabilities.
  • We suspect that perfect jailbreak resistance is not currently possible for any model provider. Every safeguard used in the industry is vulnerable to non-universal jailbreaks (which can elicit some cyber information in specific circumstances), and it is likely that universal jailbreaks will eventually be found in the future. We stated this clearly when we released Fable 5.
  • Given that perfect jailbreak resistance does not appear to be possible today, Anthropic adopted a defense in depth strategy with Fable 5. We aimed to make jailbreaks either narrow (in the case of non-universal jailbreaks) or very expensive to produce (in the case of universal jailbreaks), and to combine this with thorough monitoring to quickly detect and shut down any successful attacks. This is also why Anthropic has required 30-day retention of customer data with Fable—a policy change that carries real costs for us with customers, but that allows us to research and mitigate jailbreaks.
  • We stand by this defense in depth strategy. It reduces the risks posed by Fable, making them comparable to the risks of existing models already deployed across the industry.
  • We have not even received a disclosure of a concerning non-universal potential jailbreak that led to a harmful result. The potential jailbreaks that have been disclosed to us are either entirely benign responses or are minor findings that provide no Mythos-specific uplift.

To date, the government has only given us verbal evidence of a potential narrow, non-universal jailbreak, which essentially consists of asking the model to read a specific codebase and fix any software flaws. Our understanding is that one potential jailbreak was shared with the government. We have reviewed a report that we believe is the basis of the government's directive and validated that the level of capability displayed there is widely available from other models (including OpenAI’s GPT-5.5), and is used every day by the defenders who keep systems safe. We will share more details over the next 24 hours.

We are complying with the government’s legal directive and are removing access to Fable 5 and Mythos 5 for all users. However, we disagree that the finding of a narrow potential jailbreak should be cause for recalling a commercial model deployed to hundreds of millions of people. If this standard was applied across the industry, we believe it would essentially halt all new model deployments for all frontier model providers.

As we have stated publicly, we believe the government should have the ability to block unsafe deployments, as part of a statutory process that is transparent, fair, clear, and grounded in technical facts. This action does not adhere to those principles.

We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible.

联系我们 contact @ memedata.com