Probing Multimodal Large Language Models on Cognitive Biases in Chinese Short-Video Misinformation
Summary
A new evaluation framework assesses Multimodal Large Language Models' (MLLMs) robustness against misinformation in Chinese short videos, using a manually annotated dataset of 200 videos from Douyin and Kuaishou. This dataset, covering four health domains, features fine-grained annotations for experimental errors, logical fallacies, and fabricated claims, verified by authoritative evidence. Eight frontier MLLMs, including Gemini-2.5-Pro, GPT-4o, and Qwen models, were evaluated across five input modalities. Gemini-2.5-Pro achieved the highest belief score of 71.5/100 in the multimodal setting, while o3 scored 35.2. The study revealed MLLMs are susceptible to cognitive biases: high video popularity surprisingly increased model confidence in their initial judgments, and authoritative channel IDs significantly influenced trust, often leading to lower performance on false content from verified sources. Multimodal inputs did not consistently outperform visual-only contexts.
Key takeaway
For AI Scientists and Machine Learning Engineers developing MLLMs for misinformation detection, recognize that current models are susceptible to cognitive biases like authority bias, where verified channels can deceptively influence trust. Your development efforts should focus on explicitly training models to be robust against such social cues and improving their ability to identify logical fallacies, as multimodal inputs alone do not guarantee superior performance. Prioritize bias mitigation and advanced reasoning over simply adding more modalities.
Key insights
MLLMs struggle with short-video misinformation due to cognitive biases like authority and popularity effects, despite multimodal reasoning capabilities.
Principles
- MLLMs exhibit authority bias, trusting verified channels regardless of content.
- High video popularity can reinforce MLLM's initial judgments, not necessarily induce false trust.
- Logical fallacies are particularly challenging for MLLMs to identify in videos.
Method
A framework evaluates MLLMs using a 200-video dataset with fine-grained error annotations (experimental errors, logical fallacies, fabricated claims) and a 7-level Likert scale for trustworthiness, calculating a normalized Belief Score (BS).
In practice
- Prioritize visual modality for short-video misinformation detection.
- Scrutinize MLLM outputs for biases related to source authority.
- Focus on improving MLLM's logical fallacy detection capabilities.
Topics
- Multimodal Large Language Models
- Misinformation Detection
- Cognitive Biases
- Short-Video Platforms
- Authority Bias
- Fact-Checking
Code references
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Ethicist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.