HKVM-RAG: Key-Value-Separated Hypergraph Evidence Organization for Multi-Hop RAG

· Source: cs.CL updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, extended

Summary

HKVM-RAG is a novel key-value-separated evidence-organization layer designed for multi-hop Retrieval-Augmented Generation (RAG) systems. It addresses the challenge of organizing retrieved text into explicit answer chains under fixed retrieval budgets. HKVM-RAG constructs answer-path hyperedges from cached passage-level LLM evidence tuples, using these as higher-order retrieval keys while retaining passage text as values. A controlled protocol isolates key-space design, showing that weighted hypergraph key-value retrieval improves over KG-PPR by +3.426 F1 on 2WikiMultiHopQA and +3.592 F1 on MuSiQue. While HotpotQA showed no standalone F1 gains, a dense-aware controller combining frozen ColBERTv2 and HKVM features achieved 88.846 F1 on 2WikiMultiHopQA, 65.073 F1 on MuSiQue, and 85.810 F1 on HotpotQA, improving over ColBERTv2 by +11.084, +6.763, and +5.966 F1, respectively. Code and data are available on GitHub and Hugging Face.

Key takeaway

For Machine Learning Engineers building multi-hop RAG systems, consider integrating HKVM-RAG's hypergraph evidence organization. Its key-value-separated approach significantly improves answer F1 on benchmarks like 2WikiMultiHopQA and MuSiQue when combined with dense retrievers like ColBERTv2. You should explore using its rank/score outputs as a control signal to reorder retrieved passages, enhancing overall system performance rather than replacing dense retrieval.

Key insights

HKVM-RAG uses hypergraph keys to organize multi-hop evidence, improving RAG performance by complementing dense retrievers.

Principles

Method

HKVM-RAG assembles answer-path hyperedges from LLM-extracted evidence tuples, weights them with confidence, diffuses query-seeded scores, and projects to passage-value scores. A dense-aware controller then reorders passages using combined rank/score features.

In practice

Topics

Code references

Best for: Research Scientist, AI Architect, AI Engineer, AI Scientist, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.