Claude Opus 4.6 brings one million token context window to Anthropic's flagship model

2026-02-05 · Source: The Decoder · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Cybersecurity & Data Privacy · Depth: Intermediate, short

Summary

Anthropic has released Claude Opus 4.6, its new flagship large language model, featuring a one-million-token context window now available in beta. This model significantly improves retrieval accuracy, scoring 76 percent on the MRCR v2 test with one million tokens, compared to its predecessor Sonnet 4.5's 18.5 percent. Opus 4.6 also leads in knowledge work benchmarks, achieving 1,606 Elo points on GDPval-AA, surpassing OpenAI's GPT-5.2 by 144 points. New API features include "Adaptive Thinking" and "Compaction" for managing context, alongside increased maximum output to 128,000 tokens. Despite performance gains, the model exhibits a slight increase in vulnerability to indirect prompt injection attacks.

Key takeaway

For CTOs and VPs of Engineering evaluating large language models for complex document processing or knowledge work, Claude Opus 4.6 presents a compelling option due to its one-million-token context window and superior benchmark performance in retrieval and knowledge tasks. However, you should carefully assess its increased vulnerability to indirect prompt injection, especially for agentic AI applications, and implement robust security measures.

Key insights

Claude Opus 4.6 offers a one-million-token context window with enhanced retrieval and knowledge work performance.

Principles

Larger context windows can degrade performance without mitigation.
Benchmarking provides a rough indicator of real-world performance.

Method

Anthropic addresses "context rot" with model improvements and a "Compaction" feature that summarizes older context automatically.

In practice

Use "Compaction" to manage large conversation histories.
Adjust the "effort parameter" for simpler tasks to reduce costs.
Explore "Agent Teams" for parallel AI agent task execution.

Topics

Claude Opus 4.6
Large Context Windows
LLM Benchmarks
Prompt Injection
API Integrations

Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Engineer, Machine Learning Engineer, Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.