Awakening Dormant Experts:Counterfactual Routing to Mitigate MoE Hallucinations
Summary
Sparse Mixture-of-Experts (MoE) models, despite their scalability, are prone to hallucinations, especially with long-tail knowledge, due to static Top-$k$ routing that prioritizes high-frequency patterns. This causes "specialist experts" with crucial rare knowledge to remain "dormant" due to low gating scores. To combat this, Counterfactual Routing (CoR) is proposed as a training-free inference framework. CoR uses layer-wise perturbation analysis and the Counterfactual Expert Impact (CEI) metric to dynamically reallocate computational resources from syntax-dominant to knowledge-intensive layers, activating causally decisive experts through virtual ablation. Experiments on TruthfulQA, FACTOR, and TriviaQA show CoR improves factual accuracy by 3.1% on average without increasing the inference budget, outperforming static scaling.
Key takeaway
For AI Engineers deploying MoE models and struggling with factual hallucinations, Counterfactual Routing (CoR) offers a training-free solution to improve accuracy. You should consider integrating CoR to dynamically activate specialist experts, enhancing performance on long-tail knowledge without incurring additional inference costs. This approach provides a superior Pareto frontier compared to simply scaling up existing models.
Key insights
Static Top-$k$ routing in MoE models causes "dormant experts" and hallucinations; dynamic routing can activate them.
Principles
- Static routing favors frequent patterns.
- Long-tail knowledge requires specialist experts.
Method
Counterfactual Routing (CoR) uses layer-wise perturbation and Counterfactual Expert Impact (CEI) to dynamically shift resources from syntax-dominant to knowledge-intensive layers, activating dormant experts.
In practice
- Apply CoR to MoE models for factual accuracy.
- Use CoR without increasing inference budget.
Topics
- Mixture-of-Experts
- Counterfactual Routing
- MoE Hallucinations
- Counterfactual Expert Impact
- Factual Accuracy
Best for: AI Engineer, NLP Engineer, Research Scientist, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.