The US government may be asking Anthropic the impossible by demanding unhackable LLMs

· Source: The Decoder · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy · Depth: Intermediate, quick

Summary

The US government is in conflict with Anthropic, accusing the AI company of disregarding Trump's cyber executive order by releasing its Fable 5 model without explicit approval from a designated clearinghouse. Government officials, reportedly tipped by Amazon and other tech companies, claim Anthropic knew about a potential "jailbreak" in Fable 5. This accusation highlights the government's perceived lack of understanding of AI security, as experts like OpenAI have stated that prompt injection, a related hacking method, may never be fully solved, implying no LLM is truly "unhackable." Meanwhile, over 100 security experts and tech executives, including Alex Stamos and Rachel Tobac, have published an open letter defending Anthropic. They argue that Fable 5, along with models like GPT-5.5, Opus, Sonnet, and Chinese Kimi 2.7, are valuable tools for finding security flaws and that export controls would strip defenders of essential resources, especially as Chinese open-weight models are rapidly catching up. Anthropic CEO Dario Amodei previously warned in 2023 that jailbreaks could be "life or death."

Key takeaway

For policymakers considering AI export controls, you must recognize that demanding "unhackable" frontier AI models is currently an impossible standard. Your focus should shift from preventing all vulnerabilities to rapidly developing and deploying countermeasures, as even leading AI companies acknowledge inherent security risks. Restricting access to advanced models like Fable 5 could inadvertently disadvantage your own cybersecurity efforts, especially given the rapid progress of international competitors.

Key insights

The US government's demand for "unhackable" LLMs reveals a fundamental misunderstanding of current AI security limitations.

Principles

In practice

Topics

Best for: CTO, VP of Engineering/Data, Executive, Policy Maker, AI Security Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.