Does RAG Know When Retrieval Is Wrong? Diagnosing Context Compliance under Knowledge Conflict
Summary
A new study introduces Context-Driven Decomposition (CDD), an inference-time belief-decomposition probe designed to diagnose how Retrieval-Augmented Generation (RAG) models handle knowledge conflicts where retrieved context contradicts the model's parametric knowledge. CDD acts as an intervention mechanism for controlled retrieval conflict, revealing three key patterns across Epi-Scale stress tests, TruthfulQA misconception injection, and cross-model reruns. First, context compliance is measurable, with Standard RAG achieving only 15.0% accuracy on TruthfulQA misconception injection (N=500). Second, adversarial accuracy gains from CDD transfer across model families like Gemini-2.5-Flash and Claude Haiku/Sonnet/Opus, though the rationale-answer causal coupling does not, indicating different underlying mechanisms. Third, explicit conflict decomposition improves robustness against temporal drift and noisy distractors, with CDD reaching 71.3% on temporal shifts and 69.9% on distractor evidence on the full Epi-Scale benchmark. These findings establish context compliance as a structural axis for RAG probing, distinct from retrieval quality, and motivate the release of Epi-Scale for further systematic study.
Key takeaway
For AI Architects and Research Scientists evaluating RAG system robustness, this research highlights that standard RAG struggles significantly when retrieved context conflicts with internal knowledge, achieving only 15% accuracy in some adversarial settings. You should integrate tools like Context-Driven Decomposition (CDD) and benchmarks like Epi-Scale into your evaluation pipelines to systematically diagnose and improve context compliance, especially when deploying RAG in dynamic or adversarial environments where factual consistency is paramount. This approach helps ensure your RAG systems maintain accuracy under knowledge conflict.
Key insights
Context-Driven Decomposition (CDD) diagnoses how RAG models handle conflicting information between retrieved context and parametric knowledge.
Principles
- Context compliance is a measurable RAG vulnerability.
- Adversarial accuracy gains can transfer across model families.
- Explicit conflict decomposition enhances RAG robustness.
Method
Context-Driven Decomposition (CDD) is an inference-time belief-decomposition probe that intervenes on RAG models to diagnose context compliance under controlled retrieval conflict scenarios.
In practice
- Use CDD to measure RAG context compliance.
- Apply CDD for improved robustness against temporal drift.
- Evaluate RAG systems with Epi-Scale benchmark.
Topics
- Retrieval-Augmented Generation
- Context Compliance
- Knowledge Conflict
- Context-Driven Decomposition
- Epi-Scale Benchmark
Code references
Best for: AI Architect, AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.