Authority, Truth, and Citation Bias: A Large-Scale Multi-Domain Benchmark for Studying Epistemic Susceptibility in Large Language Models
Summary
AuthorityBench, a new 220,564-prompt multi-domain benchmark, reveals that citation presence significantly influences epistemic behavior in large language models (LLMs), often increasing hallucination rates. This benchmark employs a novel 2x2 factorial design, crossing claim veracity with citation veracity across general knowledge, science, law, and medicine domains, while controlling for 40 prompt templates, four venue prestige tiers, and author demographics. Evaluating seven models, the study found that both real and fabricated citations consistently elevate hallucination compared to a no-citation baseline. The effect is most pronounced when fabricated citations accompany true claims, boosting hallucination by 3 to 22 percentage points and reaching 35 to 77% in general knowledge, though legal claims demonstrate greater robustness. Venue prestige and author demographics had negligible impact.
Key takeaway
For machine learning engineers deploying LLMs in citation-augmented environments, you must implement robust fact-checking mechanisms, as this research demonstrates that citation presence, even fabricated ones, significantly increases hallucination rates. Do not assume citations guarantee factual accuracy; instead, prioritize independent verification, especially for general knowledge domains where susceptibility is highest. Your evaluation pipelines should account for this epistemic susceptibility.
Key insights
Large language models exhibit epistemic susceptibility, increasing hallucination rates when citations are present, regardless of their veracity.
Principles
- Citation presence consistently increases LLM hallucination.
- Fabricated citations with true claims amplify hallucination significantly.
- Legal domain claims show comparative robustness to citation bias.
Method
AuthorityBench uses a 2x2 factorial design, crossing claim and citation veracity across four domains, with controlled variation in prompt templates, venue prestige, and author names.
In practice
- Use AuthorityBench to evaluate LLM epistemic susceptibility.
- Prioritize fact-checking LLM outputs even with citations.
Topics
- Large Language Models
- Citation Bias
- LLM Hallucination
- Epistemic Susceptibility
- Benchmark Datasets
- Fact-Checking
Code references
Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, Machine Learning Engineer, AI Ethicist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.