Elastic Open-Sources Atlas Agent Memory Based on Cognitive Science
Summary
Elastic has open-sourced Atlas, an Elasticsearch-based system designed to provide long-term memory for AI agents interacting with users over extended periods. Released on June 30, 2026, Atlas addresses the scalability and "lost in the middle" issues associated with stuffing entire interaction histories into LLM context windows. The system implements three cognitive science-inspired memory types—episodic, semantic, and procedural—each managed in separate Elasticsearch indices. Episodic memories capture events, semantic memories store consolidated facts identified by an LLM, and procedural memories track "playbooks" with success/failure metrics. Agents access these memories via a hybrid query combining BM25 lexical search and Jina v5 semantic search, re-ranked by a cross-encoder, achieving 0.89 Recall@10 in question-answering. Document-level security ensures per-user memory isolation.
Key takeaway
For AI Engineers designing conversational agents that require persistent, scalable memory across long user interactions, Elastic's open-sourced Atlas offers a robust architectural blueprint. You should evaluate its cognitive science-inspired three-memory-type approach and hybrid search strategy to overcome context window limitations and "lost in the middle" effects. Consider adopting its Elasticsearch-based design for managing diverse memory types and ensuring per-user data isolation, potentially saving significant development time.
Key insights
Atlas provides scalable, long-term agent memory by categorizing information into episodic, semantic, and procedural types based on cognitive science.
Principles
- Agent long-term memory benefits from distinct episodic, semantic, and procedural stores.
- LLMs can consolidate transient episodic events into durable semantic facts and procedural playbooks.
- Hybrid search combining lexical and semantic methods improves memory retrieval accuracy.
Method
Atlas stores user inputs as episodic events, uses an LLM to consolidate these into semantic facts and procedural playbooks, and retrieves memories via a hybrid RRF query over BM25 and Jina v5, re-ranked by a cross-encoder.
In practice
- Implement per-user memory isolation using document-level security.
- Employ Reciprocal Rank Fusion for combining diverse search results.
- Consider Elasticsearch for scalable, multi-modal memory storage with advanced features.
Topics
- Atlas
- Agent Memory
- Elasticsearch
- Large Language Models
- Reciprocal Rank Fusion
- Cognitive Science
Code references
Best for: AI Engineer, Machine Learning Engineer, AI Architect
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by InfoQ.