AURA: Action-Gated Memory for Robot Policies at Constant VRAM
Summary
AURA-Mem (Action-Utility Recurrent Adaptive Memory) introduces a novel memory architecture designed for embodied AI agents operating on bandwidth-limited edge hardware, such as robots. This system addresses the inefficiencies of traditional KV-caches, which are optimized for datacenter inference with short, batched requests, but struggle with the long, non-resetting episodes typical of robotic tasks. AURA-Mem integrates a constant-size recurrent memory with a frozen vision-language-action backbone and a learned gate. This gate selectively writes to memory only when an observation would alter the subsequent action, trained directly against a closed-loop action-error signal. Its inference state remains fixed at 4,224 bytes, contrasting sharply with a KV-cache that can grow 6,061 times larger over 100,000 steps. Benchmarks show AURA-Mem matches O(1) baselines in accuracy while reducing memory writes by 5.19-6.13 times, and up to 9.19 times on simpler configurations. On the OpenVLA-OFT 7B panel on LIBERO-Long, it achieved a 0.233 success rate, matching ungated policies and slightly exceeding the 0.217 of an always-write KV arm, all while using 7.0 times fewer writes and maintaining constant memory.
Key takeaway
For Robotics Engineers developing long-running embodied AI agents on edge hardware, AURA-Mem offers a critical solution to VRAM and memory write limitations. You should consider integrating action-gated memory architectures to maintain constant inference state and drastically reduce write operations. This approach, demonstrated to match or exceed KV-cache performance while using 7.0 times fewer writes, allows for more efficient and sustainable deployment of complex policies without sacrificing success rates.
Key insights
AURA-Mem uses action-gated, constant-size memory for robots, significantly reducing writes and VRAM growth compared to KV-caches without sacrificing performance.
Principles
- Memory writes can be a binding constraint on edge hardware.
- Gate memory writes based on action-surprise signals.
- Constant memory size is crucial for long-horizon tasks.
Method
AURA-Mem wraps a frozen vision-language-action backbone with constant-size recurrent memory and a learned gate. This gate writes only when observations change the next action, trained via a closed-loop action-error signal.
In practice
- Implement action-gated memory for embodied agents.
- Prioritize memory write reduction in edge AI systems.
- Evaluate memory architectures for long-episode tasks.
Topics
- Robot Policies
- Edge AI Hardware
- Memory Architectures
- KV-Cache Optimization
- Recurrent Memory
- Embodied Agents
Best for: Research Scientist, AI Scientist, Robotics Engineer, AI Hardware Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.