Probing Multimodal Large Language Models on Cognitive Biases in Chinese Short-Video Misinformation

· Source: cs.CL updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, extended

Summary

A new evaluation framework assesses Multimodal Large Language Models' (MLLMs) robustness against misinformation in Chinese short videos, using a manually annotated dataset of 200 videos from Douyin and Kuaishou. This dataset, covering four health domains, features fine-grained annotations for experimental errors, logical fallacies, and fabricated claims, verified by authoritative evidence. Eight frontier MLLMs, including Gemini-2.5-Pro, GPT-4o, and Qwen models, were evaluated across five input modalities. Gemini-2.5-Pro achieved the highest belief score of 71.5/100 in the multimodal setting, while o3 scored 35.2. The study revealed MLLMs are susceptible to cognitive biases: high video popularity surprisingly increased model confidence in their initial judgments, and authoritative channel IDs significantly influenced trust, often leading to lower performance on false content from verified sources. Multimodal inputs did not consistently outperform visual-only contexts.

Key takeaway

For AI Scientists and Machine Learning Engineers developing MLLMs for misinformation detection, recognize that current models are susceptible to cognitive biases like authority bias, where verified channels can deceptively influence trust. Your development efforts should focus on explicitly training models to be robust against such social cues and improving their ability to identify logical fallacies, as multimodal inputs alone do not guarantee superior performance. Prioritize bias mitigation and advanced reasoning over simply adding more modalities.

Key insights

MLLMs struggle with short-video misinformation due to cognitive biases like authority and popularity effects, despite multimodal reasoning capabilities.

Principles

Method

A framework evaluates MLLMs using a 200-video dataset with fine-grained error annotations (experimental errors, logical fallacies, fabricated claims) and a 7-level Likert scale for trustworthiness, calculating a normalized Belief Score (BS).

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.