Claude Fable won’t answer basic biology questions
Summary
Anthropic just released Claude Fable 5, a Mythos-class model, touting it as its most powerful AI and praising its biology skills. However, the model intentionally refuses to answer basic biology questions, such as "what are mitochondria" or "how mRNA vaccines work," instead deferring these queries to the former flagship model, Claude Opus 4.8. This conservative filtering is a deliberate design choice by Anthropic to mitigate bioweapons concerns, as Fable 5's advanced capabilities could potentially be misused for risky biological research. While Fable 5 showed more willingness to answer questions in chemistry and cybersecurity, it still limits responses for highly toxic agents like sarin gas. Anthropic states this is a temporary tradeoff to enable earlier public access, with plans to refine detection and reduce false positives for future releases to the biology and life sciences community.
Key takeaway
For AI Product Managers evaluating new LLM integrations, understand that advanced models like Claude Fable 5 may implement highly conservative content filters, particularly in sensitive domains like biology, to mitigate bioweapons risks. This design choice can lead to unexpected refusals for basic queries, requiring you to plan for fallback mechanisms or alternative models like Opus 4.8. Factor these intentional limitations into your product's user experience and capability expectations, and monitor vendor updates on guardrail refinements.
Key insights
Anthropic's Claude Fable 5, a powerful Mythos-class AI, intentionally blocks basic biology queries due to bioweapons misuse concerns, deferring to an older model.
Principles
- AI safety requires conservative content filtering.
- Advanced models pose bioweapon misuse risks.
- Tradeoffs exist between capability and public access.
Method
Anthropic uses classifiers to block bioweapons-related requests and implements overly conservative safeguards for biology queries, deferring blocked responses to Claude Opus 4.8 to manage risks.
In practice
- Test new AI models for unexpected content blocks.
- Use fallback models for sensitive queries.
- Expect evolving safety guardrail policies.
Topics
- Claude Fable 5
- AI Safety
- Content Moderation
- Bioweapons Research
- Large Language Models
- Anthropic
Best for: CTO, VP of Engineering/Data, Director of AI/ML, Tech Journalist, AI Product Manager, AI Ethicist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Verge.