An update on recent Claude Code quality reports
Summary
Anthropic identified and resolved three distinct issues that led to recent reports of degraded code quality in Claude Code, Claude Agent SDK, and Claude Cowork, affecting Sonnet 4.6, Opus 4.6, and Opus 4.7. The first issue, from March 4, involved changing Claude Code's default reasoning effort from `high` to `medium` to reduce latency, which was reverted on April 7 due to user preference for higher intelligence. The second, on March 26, was a bug in a caching optimization that caused Claude to continuously clear older thinking from idle sessions, leading to forgetfulness and repetition; this was fixed on April 10. The third issue, introduced on April 16, was a system prompt instruction to reduce verbosity that inadvertently harmed coding quality and was reverted on April 20. All issues were resolved by April 20 (v2.1.116), and Anthropic is resetting usage limits for all subscribers as of April 23, 2026.
Key takeaway
For engineering leaders overseeing AI product development, this incident underscores the need for robust change management and testing protocols. Your teams should implement gradual rollouts, expand internal testing with public builds, and enhance automated code review with broader context. Prioritize user feedback channels to detect subtle degradations that internal metrics might miss, especially when balancing performance tradeoffs like latency versus intelligence. This proactive approach minimizes unexpected quality regressions.
Key insights
Three distinct changes caused Claude Code's perceived degradation, highlighting the complexity of LLM product management.
Principles
- Prioritize intelligence over latency for complex tasks.
- Thoroughly test prompt changes across broad eval suites.
- User feedback is critical for identifying subtle degradations.
Method
Anthropic traced degradation reports to specific changes, performed ablations on system prompts, and used an improved Code Review tool to identify and fix bugs.
In practice
- Implement gradual rollouts for significant model changes.
- Use internal dogfooding with public builds.
- Expand context for automated code review tools.
Topics
- Claude Code
- Model Degradation
- Reasoning Effort
- Caching Optimization
- System Prompt Changes
Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Engineer, Machine Learning Engineer, MLOps Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Anthropic Engineering Blog.