PEEK: Context Map as an Orientation Cache for Long-Context LLM Agents
Summary
PEEK is a novel system designed to enhance large language model (LLM) agents operating with long and recurring external contexts, such as document corpora or code repositories. Unlike existing methods that preserve agent trajectories or raw material access, PEEK focuses on caching and maintaining "orientation knowledge" about the context itself. This knowledge, including context organization, useful entities, and schemas, is stored as a small, constant-sized "context map" within the agent's prompt. PEEK's maintenance policy comprises a Distiller for extracting transferable knowledge, a Cartographer for structured edits, and a priority-based Evictor for token budget enforcement. Benchmarking shows PEEK improves long-context reasoning and information aggregation by 6.3-34.0%, reducing iterations by 93-145 and costs by 1.7-5.8x compared to ACE, a prominent prompt-learning framework. For context learning, it boosts solving rate and rubric accuracy by 6.0-14.0% and 7.8-12.1%, respectively, at 1.4x lower cost than ACE, demonstrating broad applicability across LMs like OpenAI Codex.
Key takeaway
For Machine Learning Engineers deploying LLM agents in environments with recurring long external contexts, you should re-evaluate traditional context management strategies. PEEK demonstrates that caching "orientation knowledge" via a dynamic context map significantly improves agent accuracy and reduces operational costs by 1.7-5.8x. Consider integrating a similar context map approach to enhance your agents' efficiency and performance, especially for tasks involving document corpora or code repositories. This method offers substantial gains over prompt-learning frameworks like ACE.
Key insights
Caching context orientation knowledge improves LLM agent efficiency and accuracy in recurring long-context tasks.
Principles
- Reusable orientation knowledge is crucial for recurring long-context workloads.
- A small, persistent context map can guide LLM agents.
- Programmable cache policies can maintain dynamic context knowledge.
Method
PEEK's cache policy uses a Distiller to extract knowledge, a Cartographer to translate it into structured edits for the context map, and an Evictor to manage a fixed token budget.
In practice
- Implement a context map for LLM agents handling document corpora.
- Use a programmable cache policy to dynamically update agent context.
- Apply PEEK's modules (Distiller, Cartographer, Evictor) for knowledge management.
Topics
- LLM Agents
- Long-Context LLMs
- Context Map
- Cache Policy
- Orientation Knowledge
- OpenAI Codex
Best for: Research Scientist, AI Engineer, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.