Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, extended

Summary

The ReMe (Remember Me, Refine Me) framework introduces a dynamic procedural memory system for large language model (LLM) agents, addressing the limitations of static, append-only memory archives. Developed by researchers from Shanghai Jiao Tong University and Alibaba Group's Tongyi Lab, ReMe enables experience-driven agent evolution through three core mechanisms: multi-faceted distillation for extracting fine-grained experiences, context-adaptive reuse via scenario-aware indexing, and utility-based refinement for autonomously managing a high-quality experience pool. Experiments on BFCL-V3 and AppWorld benchmarks demonstrate ReMe's state-of-the-art performance, with Qwen3-8B equipped with ReMe outperforming a larger, memoryless Qwen3-14B model, achieving average gains of 8.83% in Avg@4 and 7.29% in Pass@4. This suggests that self-evolving memory offers a computation-efficient pathway for lifelong learning in LLM agents. The code and reme.library dataset are publicly released.

Key takeaway

For NLP Engineers and Research Scientists developing LLM agents, ReMe demonstrates that investing in dynamic memory management can yield significant performance improvements and computational efficiencies. You should consider integrating multi-faceted distillation and utility-based refinement into your agent architectures to enable self-evolving learning, potentially allowing smaller models to achieve or exceed the performance of larger, memory-intensive baselines. This approach offers a pathway to more robust and resource-efficient agent development.

Key insights

ReMe dynamically manages LLM agent procedural memory, enabling smaller models to surpass larger, memoryless counterparts through experience-driven evolution.

Principles

Method

ReMe employs multi-faceted distillation for experience extraction, context-adaptive reuse via scenario-aware indexing, and utility-based refinement for dynamic memory pool optimization.

In practice

Topics

Code references

Best for: NLP Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.