The Agent Stack - Part 3: Control Planes, Sessions, and State Ownership
Summary
This post clarifies the critical distinction between "continuity" (user experience) and "control" (system's ability to resume without guessing) in agent systems, focusing on how systems manage state across pauses, retries, and corrections. It argues that a robust control plane is essential for resolving incoming events to a session, loading authoritative records, and managing valid continuation handles. The article emphasizes that a session serves as an isolation boundary for live interactions, not merely a history log, and highlights how the term "state" ambiguously refers to transcript state, working state, and memory. It illustrates these concepts using examples from OpenAI, LangGraph, and ADK, stressing that explicit architectural choices are necessary to define what constitutes an active run, authoritative state, and permissible continuations when work is interrupted.
Key takeaway
For AI Engineers and Architects designing agent systems, you must establish clear ownership and boundaries for session management, state, and authorization. Avoid common pitfalls like using user IDs as session keys or blurring transcript with model input. Your system's ability to handle interruptions, retries, and corrections coherently depends on explicit architectural decisions regarding authoritative state and continuation paths, ensuring robust operational behavior.
Key insights
Agent systems require a clear control plane to manage session state and ensure coherent execution across interruptions.
Principles
- Continuity is user experience; control is system resumability.
- A session is an isolation boundary, not just history.
- Model input is a prepared view, not the canonical record.
Method
The control plane resolves events to sessions, loads authoritative state, decides continuation handles, and provides a bounded state view to the runtime for each turn.
In practice
- Separate session identity from user identity.
- Explicitly name authoritative transcript and working-state stores.
- Define explicit retry, resume, and fork semantics.
Topics
- Agent Control Plane
- Session Management
- State Ownership
- Agent System Architecture
- Resume and Retry Semantics
Best for: AI Engineer, Machine Learning Engineer, AI Architect
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Agent Stack.