Anthropic's Project Glasswing - restricting Claude Mythos to security researchers - sounds necessary to me

· Source: Simon Willison's Weblog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy, Software Development & Engineering · Depth: Expert, long

Summary

Anthropic has launched Project Glasswing, a restricted preview program for its new Claude Mythos Preview model, which possesses advanced cybersecurity research capabilities. Unlike Claude Opus 4.6, which had a near-0% success rate, Mythos Preview successfully developed working exploits 181 times in a benchmark against Mozilla's Firefox 147 JavaScript engine. Anthropic claims the model has found thousands of high-severity vulnerabilities, including some in every major operating system and web browser, and can chain together multiple vulnerabilities to create sophisticated exploits. Project Glasswing provides $100M in usage credits and $4M in direct donations to open-source security organizations, partnering with entities like AWS, Apple, Microsoft, Google, and the Linux Foundation to proactively address these risks. The model will not be generally available, as Anthropic aims to develop safeguards before wider deployment of "Mythos-class" models.

Key takeaway

For CTOs and VPs of Engineering evaluating their organization's cybersecurity posture, the emergence of models like Claude Mythos Preview signals an urgent need to reassess existing vulnerability management strategies. Your teams should prioritize integrating AI-powered vulnerability discovery tools and allocate significant resources to patching efforts, as the rate of newly identified, complex exploits will accelerate dramatically. Consider participating in industry initiatives like Project Glasswing to gain early access to defensive capabilities and contribute to collective security.

Key insights

Advanced LLMs are now autonomously finding and exploiting critical software vulnerabilities at an unprecedented scale.

Principles

Method

Anthropic's Claude Mythos Preview model autonomously identifies and exploits vulnerabilities, including local privilege escalation, remote code execution, and complex multi-vulnerability chains, across major operating systems and web browsers.

In practice

Topics

Code references

Best for: CTO, VP of Engineering/Data, Executive, AI Security Engineer, Software Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Simon Willison's Weblog.