HKVM-RAG: Key-Value-Separated Hypergraph Evidence Organization for Multi-Hop RAG
Summary
HKVM-RAG is a novel key-value-separated evidence-organization layer designed for multi-hop Retrieval-Augmented Generation (RAG) systems. It addresses the challenge of organizing retrieved text into explicit answer chains under fixed retrieval budgets. HKVM-RAG constructs answer-path hyperedges from cached passage-level LLM evidence tuples, using these as higher-order retrieval keys while retaining passage text as values. A controlled protocol isolates key-space design, showing that weighted hypergraph key-value retrieval improves over KG-PPR by +3.426 F1 on 2WikiMultiHopQA and +3.592 F1 on MuSiQue. While HotpotQA showed no standalone F1 gains, a dense-aware controller combining frozen ColBERTv2 and HKVM features achieved 88.846 F1 on 2WikiMultiHopQA, 65.073 F1 on MuSiQue, and 85.810 F1 on HotpotQA, improving over ColBERTv2 by +11.084, +6.763, and +5.966 F1, respectively. Code and data are available on GitHub and Hugging Face.
Key takeaway
For Machine Learning Engineers building multi-hop RAG systems, consider integrating HKVM-RAG's hypergraph evidence organization. Its key-value-separated approach significantly improves answer F1 on benchmarks like 2WikiMultiHopQA and MuSiQue when combined with dense retrievers like ColBERTv2. You should explore using its rank/score outputs as a control signal to reorder retrieved passages, enhancing overall system performance rather than replacing dense retrieval.
Key insights
HKVM-RAG uses hypergraph keys to organize multi-hop evidence, improving RAG performance by complementing dense retrievers.
Principles
- Multi-hop RAG requires explicit evidence organization.
- Hypergraph keys enhance multi-hop evidence structuring.
- Structured signals effectively complement dense retrieval.
Method
HKVM-RAG assembles answer-path hyperedges from LLM-extracted evidence tuples, weights them with confidence, diffuses query-seeded scores, and projects to passage-value scores. A dense-aware controller then reorders passages using combined rank/score features.
In practice
- Use DeepSeek V4 Flash for evidence tuple extraction.
- Combine hypergraph signals with ColBERTv2 for re-ranking.
- Isolate key-space design for controlled RAG experiments.
Topics
- Multi-hop RAG
- Hypergraph Neural Networks
- Evidence Organization
- Retrieval-Augmented Generation
- ColBERTv2
- Knowledge Graphs
Code references
Best for: Research Scientist, AI Architect, AI Engineer, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.