Anthropic’s safety warnings may have just backfired — the government has pulled the plug on its most powerful AI

· Source: AI News & Artificial Intelligence | TechCrunch · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy · Depth: Intermediate, short

Summary

The U.S. government ordered Anthropic to immediately disable access to its Claude Fable 5 and Claude Mythos 5 AI models worldwide, citing national security concerns. Anthropic complied but expressed disagreement, arguing the directive, framed as an export control action, stems from a claimed "potential narrow, non-universal jailbreak" of Fable 5. Mythos, Anthropic's most capable model, was previously restricted and shared with 50 vetted organizations like Amazon and Microsoft via Project Glasswing for defensive cybersecurity due to its ability to find software vulnerabilities. Fable 5, released three days prior, was a guarded version of Mythos intended for general public use, benchmarked by Vals AI as the most capable public AI model. Anthropic contends the identified capability is already widespread and used defensively, and that its independent safeguards remain effective.

Key takeaway

For AI Policy Makers evaluating frontier model deployment, this incident underscores the immediate regulatory risks associated with perceived AI dangers, even when models are designed with safeguards. Your policies must balance national security concerns, like potential jailbreaks, with the industry's need for clear, consistent standards to avoid halting innovation. Consider the implications of export controls on global access and the role of independent safety mechanisms.

Key insights

Government intervention in AI model deployment, spurred by developer safety warnings, highlights complex regulatory and commercial tensions.

Principles

Method

Project Glasswing involves sharing highly capable, restricted AI models with vetted organizations for defensive cybersecurity vulnerability identification.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Investor, Policy Maker, AI Security Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by AI News & Artificial Intelligence | TechCrunch.