Claude Opus 4.6 brings one million token context window to Anthropic's flagship model
Summary
Anthropic has released Claude Opus 4.6, its new flagship large language model, featuring a one-million-token context window now available in beta. This model significantly improves retrieval accuracy, scoring 76 percent on the MRCR v2 test with one million tokens, compared to its predecessor Sonnet 4.5's 18.5 percent. Opus 4.6 also leads in knowledge work benchmarks, achieving 1,606 Elo points on GDPval-AA, surpassing OpenAI's GPT-5.2 by 144 points. New API features include "Adaptive Thinking" and "Compaction" for managing context, alongside increased maximum output to 128,000 tokens. Despite performance gains, the model exhibits a slight increase in vulnerability to indirect prompt injection attacks.
Key takeaway
For CTOs and VPs of Engineering evaluating large language models for complex document processing or knowledge work, Claude Opus 4.6 presents a compelling option due to its one-million-token context window and superior benchmark performance in retrieval and knowledge tasks. However, you should carefully assess its increased vulnerability to indirect prompt injection, especially for agentic AI applications, and implement robust security measures.
Key insights
Claude Opus 4.6 offers a one-million-token context window with enhanced retrieval and knowledge work performance.
Principles
- Larger context windows can degrade performance without mitigation.
- Benchmarking provides a rough indicator of real-world performance.
Method
Anthropic addresses "context rot" with model improvements and a "Compaction" feature that summarizes older context automatically.
In practice
- Use "Compaction" to manage large conversation histories.
- Adjust the "effort parameter" for simpler tasks to reduce costs.
- Explore "Agent Teams" for parallel AI agent task execution.
Topics
- Claude Opus 4.6
- Large Context Windows
- LLM Benchmarks
- Prompt Injection
- API Integrations
Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Engineer, Machine Learning Engineer, Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.