Scaling Managed Agents: Decoupling the brain from the hands

· Source: Anthropic Engineering Blog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Cloud Computing & IT Infrastructure · Depth: Advanced, long

Summary

Anthropic's Managed Agents, a hosted service within the Claude Platform, addresses the challenge of designing long-horizon AI agents whose underlying model capabilities evolve rapidly. Published on April 8, 2026, this system decouples the "brain" (Claude and its harness) from the "hands" (sandboxes and tools) and the "session" (event log). This architecture virtualizes agent components like session, harness, and sandbox into stable interfaces, allowing their implementations to change independently without breaking the overall system. This approach resolves issues like "context anxiety" in older models and the "pets-vs-cattle" problem of tightly coupled infrastructure, where a single container failure could lead to lost sessions and difficult debugging. By making harnesses and containers stateless and interchangeable, Managed Agents significantly reduces time-to-first-token (TTFT) by approximately 60% at p50 and over 90% at p95, enabling efficient scaling and enhanced security.

Key takeaway

For engineering leaders building long-horizon AI agents, your teams should adopt decoupled architectures like Anthropic's Managed Agents to future-proof against rapidly evolving LLM capabilities. This approach ensures your agent infrastructure remains stable and scalable, preventing costly re-engineering as models improve and significantly reducing latency. Prioritize designing stable interfaces for agent components to avoid tight coupling and enable independent evolution and robust failure recovery.

Key insights

Decoupling AI agent components into stable interfaces enables adaptability to evolving models and scalable, resilient operations.

Principles

Method

Managed Agents virtualizes agent components (session, harness, sandbox) into independent interfaces. The harness calls sandboxes as tools, and the session log stores context durably outside the harness, allowing independent failure recovery and scaling.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Architect, MLOps Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Anthropic Engineering Blog.