Control-Plane Placement Shapes Forgetting: An Architectural Study of Agent Memory Across Thirteen System Configurations
Summary
An architectural study comparing thirteen agent memory system configurations reveals that control-plane LLM placement significantly shapes forgetting recovery. The research identifies three placement regimes: deterministic primitives handle lexical/temporal categories but fail canonicalization (5% on identifier-obfuscation, 0% on cross-lingual); inscribe-time LLMs recover canonicalization (100%) but not intent-aware deletion (0% on prefix-collision/compound-fact); and a mutation-time hook recovers intent-aware deletion (78-85%) and boosts overall recovery to 91.7-93.2% at \$0.17 per 385-case run and 2.3s/case latency. The study introduces ForgetEval, a 1000-case templated suite with a 385-case adversarial layer, and a six-method Adapter Protocol, both released under MIT, to benchmark these failures.
Key takeaway
For AI Architects designing agent memory systems, understanding LLM placement's impact on forgetting is crucial. You should prioritize integrating mutation-time LLM hooks within the control plane, as this configuration significantly improves recovery from canonicalization and intent-aware deletion failures, achieving 91.7-93.2% overall. Additionally, utilize the ForgetEval benchmark to rigorously test your system's resilience against diverse forgetting scenarios.
Key insights
LLM placement within an agent's memory control plane critically determines recovery from various forgetting failure modes.
Principles
- Production failures are predominantly forgetting failures, not recall failures.
- Deterministic memory primitives struggle with canonicalization.
- Mutation-time LLM hooks offer superior, broad-spectrum forgetting recovery.
Method
The ForgetEval suite, comprising a 1000-case templated suite and a 385-case adversarial layer, uses deterministic substring matching and a six-method Adapter Protocol for heterogeneous memory store evaluation.
In practice
- Integrate mutation-time LLM hooks for robust agent memory.
- Utilize ForgetEval to benchmark forgetting capabilities.
- Adopt the Adapter Protocol for diverse memory store integration.
Topics
- Agent Memory
- LLM Placement
- Forgetting Failures
- Control Plane
- ForgetEval
- Memory Benchmarking
Best for: Research Scientist, AI Engineer, AI Product Manager, AI Scientist, Machine Learning Engineer, AI Architect
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.