TechnologyFeatured4 min readlogoRead on WIRED

Anthropic and White House Divided Over Claude Fable 5's Safety

Anthropic leaders recently met with White House officials to discuss security concerns over the company's advanced AI model, Claude Fable 5. Despite high-level talks, a significant disagreement remains regarding the actual risk of jailbreaking the model's guardrails. The administration has imposed export controls, while Anthropic argues the concerns are overblown. This article explores the core issues, the key players involved, and what the standoff means for the future of AI regulation and safety.

The ongoing disagreement between artificial intelligence company Anthropic and the White House over the safety of its Claude Fable 5 model took center stage during a recent round of high-level meetings in Washington, DC. Despite talks that included officials from the Commerce Department and National Security Agency, no resolution was reached regarding the export controls imposed on the advanced AI system. The core of the dispute lies in differing assessments of how easily the model's built-in safety measures can be bypassed, a question that has significant implications for both national security and the future of AI governance.

White House building in Washington, DC
The White House in Washington, DC, where the high-level meetings took place.

The Meeting and the Stalemate

On Monday, Anthropic leaders, including cofounder and chief compute officer Tom Brown and head of external affairs Sarah Heck, met with White House and Commerce Department officials to discuss the government's concerns. The administration maintains that Claude Fable 5, a version of Anthropic's powerful Mythos model with additional cybersecurity guardrails, is vulnerable to jailbreaking. This would effectively give users access to Mythos's full capabilities, which the government considers a national security risk. Anthropic representatives, however, reiterated their position that the administration's concerns are overblown.

The meetings included Commerce Secretary Howard Lutnick, who joined via conference call from the G7 summit in Evian, France, though National Cyber Director Sean Cairncross did not participate. The White House has declined to comment publicly, while an Anthropic spokesperson stated, "Both parties are working quickly to get this resolved." Despite this, no immediate pathway for lifting the export controls was established, with any resolution likely contingent on Anthropic fully addressing the jailbreak concerns to the government's satisfaction.

The Core Disagreement Over Risk

At the heart of the standoff is a fundamental disagreement over the severity of the jailbreaking vulnerability. Amazon CEO Andy Jassy first alerted the Trump administration to the potential issues by calling Treasury Secretary Scott Bessent. This prompted the NSA to review the vulnerabilities, which concluded that stripping away Fable 5's guardrails was indeed possible. Anthropic subsequently cut off access to the model for all users over the weekend, leading to a series of emergency calls with Commerce Secretary Lutnick.

In a blog post, Anthropic implied the government's characterization of the risk is overblown. This sentiment was echoed by a group of cybersecurity researchers who sent an open letter to officials, arguing that the export control action was unjustified. They noted that while Anthropic's Mythos-class models are effective at finding flaws, they are not uniquely capable, and restricting access harms defenders more than it helps security. Experts like Katie Moussouris, CEO of Luta Security, have emphasized that guardrails are generally considered "speed bumps" for adversaries and should not be treated as absolute security boundaries.

Amazon office building in Seattle
Amazon's headquarters, as the company's CEO played a key role in alerting the government.

Implications for Anthropic and the AI Industry

The dispute has created uncertainty for Anthropic's corporate future, with investors working over the weekend to assess the impact. Some investors believe the company is being singled out, suggesting a competitor might not have faced the same reaction for releasing a model with similar capabilities. The situation also sets a precedent for other AI labs planning to release advanced models, as it is now expected that they will give the White House early access and keep the government informed on launches.

Cohere CEO Aidan Gomez stated that the events are "informative for everyone that the [US] government would be willing to take these steps. No one can be naive to that reality." As both sides continue their discussions, the outcome of this standoff will likely influence the broader regulatory landscape for artificial intelligence, determining how and when powerful new models can be deployed for public use.

Office building of the US Department of Commerce
The Commerce Department, which is leading the review alongside the NSA.

Conclusion

The ongoing clash between Anthropic and the White House over Claude Fable 5's safety illustrates the complex and often contentious nature of AI governance. While both sides appear committed to reaching a resolution, the gap between their assessments of risk remains wide. How this situation resolves will not only affect Anthropic's immediate future but also set important precedents for how the United States government balances innovation with national security in the rapidly evolving field of artificial intelligence. The industry and investors alike will be watching closely for the next steps.

Enjoyed reading?Share with your circle

Similar articles

1
2
3
4
5
6
7
8