The Mirage of Performance Gains: Why Contrastive Decoding Fails to Mitigate Object Hallucinations in MLLMs?
Summary
A study reveals that widely used contrastive decoding strategies, including Visual Contrastive Decoding (VCD), Instruction Contrastive Decoding (ICD), and Self-Introspective Decoding (SID), fail to genuinely mitigate object hallucinations in Multimodal Large Language Models (MLLMs). The observed performance improvements on the POPE Benchmark are attributed to two misleading factors: a unidirectional adjustment of the model's output distribution, which biases responses towards "Yes" and artificially balances certain datasets, and the adaptive plausibility constraint, which effectively degrades sampling decoding strategies into greedy search. Experiments with LLaVA-v1.5-7B, LLaVA-v1.5-13B, and QwenVL-7B on MSCOCO and GQA datasets demonstrate that spurious methods, such as Prompt-Based Adjustment and Output Layer Modification, achieve comparable performance gains without addressing hallucinations, challenging common assumptions about contrastive decoding's effectiveness.
Key takeaway
For AI Scientists and Machine Learning Engineers evaluating MLLM hallucination mitigation, you should critically assess reported performance gains. Be aware that methods like contrastive decoding may achieve higher scores on benchmarks like POPE by merely biasing output distributions or degrading sampling to greedy search, rather than genuinely reducing hallucinations. Your evaluation criteria must extend beyond simple accuracy to ensure true mitigation, considering the impact of decoding strategies and unidirectional output shifts.
Key insights
Contrastive decoding's MLLM hallucination mitigation gains are spurious, driven by output bias and sampling degradation, not true reduction.
Principles
- Performance gains can mask true hallucination mitigation.
- Unidirectional output adjustments create false performance.
- Adaptive plausibility constraints can mimic greedy search.
Method
The paper introduces Prompt-Based Adjustment (adding "Answer Yes if possible" to prompts) and Output Layer Modification (forcing "Yes" if "Yes"/"No" probabilities are close) as spurious methods to demonstrate misleading performance gains.
In practice
- Scrutinize MLLM hallucination mitigation claims.
- Evaluate methods beyond benchmark accuracy.
- Consider decoding strategy impact on results.
Topics
- Multimodal Large Language Models
- Hallucination Mitigation
- Contrastive Decoding
- POPE Benchmark
- Decoding Strategies
- Performance Evaluation
Best for: AI Engineer, AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.