642: Mythos & Fable (Same Brain/Different Chains), Tokens-per-Watt, Nvidia's Financial Q&A, Google + SpaceX, Leopold's $20bn at 24, Heat vs Guns, Mosquito Bachelors, and WarGames
Summary
Anthropic has released Fable 5 and Mythos 5, two new "Mythos-class" large language models that share a core "brain" but differ in their safety safeguards. Fable 5, available to the public, employs "traffic cops" to redirect sensitive queries (e.g., cyber, biology, chemistry) to older models like Opus 4.8 or block them, and subtly hinders competitor development queries. Mythos 5, an "unchained" version, is for trusted access programs like Project Glasswing. Both models demonstrate significant performance gains on benchmarks like ExploitBench (69% to 78%), BioMysteryBench (29.6% to 46.1%), and Hebbia's Finance Benchmark, and excel in vision tasks. Pricing is set at \$10 per million input tokens and \$50 per million output tokens, less than half of Mythos Preview. Nvidia's CEO Jensen Huang is reframing the company as an "Operating System for AI Factories and Agents Company," emphasizing full-stack solutions, the critical role of CPUs for tool execution, and a hybrid local/cloud AI architecture where local devices act as private control layers. The article also highlights a Google-SpaceX deal for 110,000 NVIDIA GPUs worth \$920 million per month, a fund manager's rapid growth to \$20 billion AUM, and a re-evaluation of heat-related deaths in Europe versus gun deaths in the U.S.
Key takeaway
For Directors of AI/ML evaluating infrastructure investments, prioritize performance-per-watt and full-stack architectural efficiency over raw hardware cost. Your teams should consider how Anthropic's Fable 5 and Mythos 5's advanced capabilities and pricing structure could impact your model selection for specific applications, especially for high-volume token users or those requiring robust vision and reasoning. Additionally, assess the strategic implications of hybrid local/cloud AI for agent deployment, ensuring local agents handle immediate, private workflows while leveraging cloud models for complex reasoning.
Key insights
AI model capabilities are rapidly advancing, with new architectures emphasizing full-stack efficiency and hybrid deployment.
Principles
- Performance-per-watt is the key metric for modern AI factories.
- Slow CPUs can strand expensive GPU capacity in AI data centers.
- Hybrid edge-cloud AI enables private, low-latency control loops.
Method
Anthropic's Fable 5 and Mythos 5 use a shared core model with distinct external "traffic cop" classifiers to manage safety and access for different use cases.
In practice
- Evaluate AI infrastructure based on total cost of ownership per token.
- Consider CPU performance to maximize GPU utilization in AI factories.
- Explore hybrid local/cloud AI for secure, low-latency agent workflows.
Topics
- Anthropic Fable 5
- Anthropic Mythos 5
- NVIDIA AI Strategy
- AI Infrastructure
- Hybrid AI
- SpaceX Google Deal
- AI Model Safety
Best for: CTO, VP of Engineering/Data, AI Engineer, Investor, Director of AI/ML, AI Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Liberty’s Highlights.