MemPalace By Mila Jovovich: 96.6% Recall With Zero API Calls (Too Good To Be True?)

· Source: AI Advances - Medium · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Intermediate, quick

Summary

MemPalace, a spatial memory system developed by Mila Jovovich, achieves a 96.6% R@5 recall rate on LongMemEval without using any API calls or LLMs in its retrieval loop, and it operates without subscription fees. This performance has been independently reproduced on third-party hardware. The system's core approach involves storing raw conversation text verbatim and leveraging modern embeddings for retrieval, avoiding summarization or distillation to preserve signal integrity. It employs a five-stage hybrid retrieval pipeline that combines keyword overlap, temporal boosting, and preference extraction with semantic search, reaching 99.4% R@5 accuracy for less than $0.001 per query. The architecture organizes millions of tokens into a navigable "palace" structure, functioning like a file system for AI conversation history.

Key takeaway

For NLP Engineers and AI Scientists building memory systems, consider adopting a verbatim text storage and hybrid retrieval approach. MemPalace's demonstrated 96.6% R@5 recall with zero API calls suggests that simpler, more direct methods can outperform complex LLM-dependent solutions. Evaluate integrating a multi-stage retrieval pipeline and spatial data organization into your projects to potentially achieve higher accuracy and lower operational costs.

Key insights

Verbatim text storage combined with advanced embeddings and hybrid retrieval outperforms complex AI memory systems.

Principles

Method

A five-stage hybrid retrieval pipeline combines keyword overlap, temporal boosting, preference extraction, and semantic search on verbatim text.

In practice

Topics

Best for: NLP Engineer, AI Scientist, Research Scientist, Machine Learning Engineer, AI Engineer, AI Architect

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by AI Advances - Medium.