SEMA-RAG: A Self-Evolving Multi-Agent Retrieval-Augmented Generation Framework for Medical Reasoning
Summary
SEMA-RAG, a Self-Evolving Multi-Agent Retrieval-Augmented Generation framework, is proposed for medical question answering to address limitations of static, single-round RAG in clinical reasoning. Standard RAG often suffers from poor question-to-query translation and a lack of iterative sufficiency feedback, leading to unreliable evidence chains. SEMA-RAG decouples these tasks into three specialized agents: an Interpreter Agent for clinical schema interpretation, an Explorer Agent for sufficiency-driven self-evolving retrieval, and an Arbiter Agent for evidence adjudication and answer selection. Evaluated across five medical benchmarks (MMLU-Med, MedQA-US, MedMCQA, PubMedQA*, BioASQ-Y/N) and five LLM backbones (deepseek-v3.1, kimi-k2, qwen3-coder-plus, gemini-2.0-flash, glm-4.0-flash), SEMA-RAG consistently improved the strongest baseline by an average of +6.46 accuracy points per backbone. The framework's multi-agent architecture and iterative evidence exploration are crucial for its performance gains.
Key takeaway
For AI Scientists and Machine Learning Engineers developing RAG systems for high-stakes domains like healthcare, consider adopting a multi-agent, self-evolving framework. Your current single-round RAG approaches may be insufficient for complex reasoning tasks, leading to suboptimal accuracy. Implementing task decoupling and iterative, sufficiency-driven retrieval can significantly enhance evidence chain reliability and overall system performance, as demonstrated by SEMA-RAG's +6.46 accuracy point improvement.
Key insights
Multi-agent RAG with self-evolving, sufficiency-driven retrieval significantly improves medical question answering accuracy.
Principles
- Decouple complex reasoning tasks into specialized roles.
- Iterative retrieval with sufficiency feedback enhances evidence chains.
- Clinically grounded semantic interpretation improves query quality.
Method
SEMA-RAG uses an Interpreter Agent to structure questions into a clinical schema, an Explorer Agent for self-evolving, multi-round retrieval based on evidence sufficiency, and an Arbiter Agent to adjudicate evidence and select answers.
In practice
- Implement role-based agents for complex RAG tasks.
- Use iterative query refinement to address evidence gaps.
- Structure input questions into explicit schemas for better retrieval.
Topics
- SEMA-RAG
- Multi-Agent RAG
- Medical Reasoning
- Retrieval-Augmented Generation
- Clinical Schema Interpretation
Best for: AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.