Awakening Dormant Experts:Counterfactual Routing to Mitigate MoE Hallucinations

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

Sparse Mixture-of-Experts (MoE) models, despite their scalability, are prone to hallucinations, especially with long-tail knowledge, due to static Top-$k$ routing that prioritizes high-frequency patterns. This causes "specialist experts" with crucial rare knowledge to remain "dormant" due to low gating scores. To combat this, Counterfactual Routing (CoR) is proposed as a training-free inference framework. CoR uses layer-wise perturbation analysis and the Counterfactual Expert Impact (CEI) metric to dynamically reallocate computational resources from syntax-dominant to knowledge-intensive layers, activating causally decisive experts through virtual ablation. Experiments on TruthfulQA, FACTOR, and TriviaQA show CoR improves factual accuracy by 3.1% on average without increasing the inference budget, outperforming static scaling.

Key takeaway

For AI Engineers deploying MoE models and struggling with factual hallucinations, Counterfactual Routing (CoR) offers a training-free solution to improve accuracy. You should consider integrating CoR to dynamically activate specialist experts, enhancing performance on long-tail knowledge without incurring additional inference costs. This approach provides a superior Pareto frontier compared to simply scaling up existing models.

Key insights

Static Top-$k$ routing in MoE models causes "dormant experts" and hallucinations; dynamic routing can activate them.

Principles

Method

Counterfactual Routing (CoR) uses layer-wise perturbation and Counterfactual Expert Impact (CEI) to dynamically shift resources from syntax-dominant to knowledge-intensive layers, activating dormant experts.

In practice

Topics

Best for: AI Engineer, NLP Engineer, Research Scientist, AI Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.