Anthropic's new Claude Sonnet 5 closes the gap to the pricier Opus model series

· Source: The Decoder · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Emerging Technologies & Innovation · Depth: Intermediate, medium

Summary

Anthropic has released Claude Sonnet 5, its most agentic Sonnet model to date, capable of building plans and utilizing tools like browsers and terminals. Benchmarks show Sonnet 5 significantly outperforms its predecessor, Sonnet 4.6, across all tested categories, including agentic coding (63.2% on SWE-bench Pro) and multidisciplinary reasoning (57.4% on Humanity's Last Exam with tools). Notably, Sonnet 5 nearly matches the pricier Opus 4.8 in several areas and even surpasses it on real-world knowledge work, scoring 1,618 points on GDPval-AA v2 compared to Opus 4.8's 1,615. The model is available now on all Anthropic platforms with a one-million-token context window and an introductory price of \$2 per million input tokens and \$10 per million output tokens until August 31, 2026. Anthropic also states Sonnet 5 has low cybersecurity risk and improved safety features.

Key takeaway

For AI Engineers evaluating new large language models, Claude Sonnet 5 offers a compelling balance of enhanced agentic capabilities and performance, often rivaling Opus 4.8, at a more accessible price point. You should consider integrating Sonnet 5 for tasks requiring complex planning or tool use, especially given its superior performance on knowledge work benchmarks. Be mindful that increased agentic behavior might lead to higher token consumption, potentially impacting overall operational costs after the introductory pricing period ends in August 2026.

Key insights

Sonnet 5 significantly boosts agentic capabilities and performance, closing the gap to Opus models while maintaining low cybersecurity risk.

Principles

In practice

Topics

Best for: CTO, VP of Engineering/Data, AI Architect, AI Engineer, Machine Learning Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.