MemPro: Agentic Memory Systems as Evolvable Programs

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Software Development & Engineering · Depth: Expert, quick

Summary

MemPro introduces a system-level evolution framework designed to overcome limitations in existing agentic memory systems for long-horizon autonomous agents. Current memory construction-retrieval (MCR) pipelines often remain fixed post-deployment, struggling with diverse task-specific failures and evolving memory bank structures. MemPro addresses this by treating the entire MCR pipeline as an evolvable program, maintaining a version tree of implementations. An Evolving Agent iteratively selects promising versions, diagnoses recurring failures, and refines them into improved child versions. Experiments on LongMemEval, LoCoMo, HotpotQA, and NarrativeQA demonstrate that MemPro consistently outperforms static and prompt-level evolving baselines within a few iterations, shows continuous improvement, and achieves a favorable performance-cost trade-off. Code is publicly available.

Key takeaway

For AI Architects designing long-horizon autonomous agents, relying on fixed memory construction-retrieval pipelines will likely lead to performance bottlenecks and misalignment with evolving memory banks. You should consider implementing evolvable memory systems like MemPro, which dynamically adapt the entire pipeline through iterative diagnosis and refinement. This approach promises superior, continuously improving performance and better cost efficiency compared to static or prompt-level evolving baselines.

Key insights

Evolving the entire memory construction-retrieval pipeline as a program, not just the memory bank, significantly enhances long-horizon autonomous agents.

Principles

Method

MemPro maintains a version tree of runnable memory-system implementations. An Evolving Agent iteratively selects versions, diagnoses recurring failures, and creates improved child versions through failure-mode-guided edit-debug refinement.

In practice

Topics

Code references

Best for: Research Scientist, AI Engineer, NLP Engineer, AI Scientist, Machine Learning Engineer, AI Architect

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.