Anthropic's Project Glasswing - restricting Claude Mythos to security researchers - sounds necessary to me
Summary
Anthropic has launched Project Glasswing, a restricted preview program for its new Claude Mythos Preview model, which possesses advanced cybersecurity research capabilities. Unlike Claude Opus 4.6, which had a near-0% success rate, Mythos Preview successfully developed working exploits 181 times in a benchmark against Mozilla's Firefox 147 JavaScript engine. Anthropic claims the model has found thousands of high-severity vulnerabilities, including some in every major operating system and web browser, and can chain together multiple vulnerabilities to create sophisticated exploits. Project Glasswing provides $100M in usage credits and $4M in direct donations to open-source security organizations, partnering with entities like AWS, Apple, Microsoft, Google, and the Linux Foundation to proactively address these risks. The model will not be generally available, as Anthropic aims to develop safeguards before wider deployment of "Mythos-class" models.
Key takeaway
For CTOs and VPs of Engineering evaluating their organization's cybersecurity posture, the emergence of models like Claude Mythos Preview signals an urgent need to reassess existing vulnerability management strategies. Your teams should prioritize integrating AI-powered vulnerability discovery tools and allocate significant resources to patching efforts, as the rate of newly identified, complex exploits will accelerate dramatically. Consider participating in industry initiatives like Project Glasswing to gain early access to defensive capabilities and contribute to collective security.
Key insights
Advanced LLMs are now autonomously finding and exploiting critical software vulnerabilities at an unprecedented scale.
Principles
- AI progress in code generation directly enhances cyber capabilities.
- Proactive industry-wide collaboration is essential for AI security.
- LLMs can chain multiple vulnerabilities for sophisticated exploits.
Method
Anthropic's Claude Mythos Preview model autonomously identifies and exploits vulnerabilities, including local privilege escalation, remote code execution, and complex multi-vulnerability chains, across major operating systems and web browsers.
In practice
- Use LLMs for local vulnerability detection.
- Apply AI for black box testing of binaries.
- Conduct AI-driven penetration testing.
Topics
- Claude Mythos
- Project Glasswing
- Vulnerability Research
- Exploit Development
- Open-Source Security
Code references
Best for: CTO, VP of Engineering/Data, Executive, AI Security Engineer, Software Engineer, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Simon Willison's Weblog.