Hallucinated citations are polluting the scientific literature. What can be done?

· Source: Machine learning : nature.com subject feeds · Field: Science & Research — Research Methodology & Innovation, Artificial Intelligence & Machine Learning · Depth: Intermediate, medium

Summary

A growing problem of AI-hallucinated citations is polluting academic literature, with analyses showing a sharp increase in non-existent references in scholarly papers. Computer scientist Guillaume Cabanac discovered a fabricated citation of his work in the *International Dental Journal*, prompting concerns about AI-generated inaccuracies. Surveys indicate researchers increasingly use large language models (LLMs) for literature searches and bibliography formatting, leading to these fabricated references. One analysis of nearly 18,000 computer science papers found 2.6% in 2025 had at least one potentially hallucinated citation, up from 0.3% in 2024. *Nature*'s news team, in collaboration with Grounded AI, estimates tens of thousands of 2025 publications, including journal papers and books, likely contain invalid AI-generated references. Publishers like Frontiers and Taylor & Francis are developing or exploring AI tools to screen submissions, with some editors rejecting a significant percentage of papers due to fake references.

Key takeaway

For research integrity officers and journal editors, the surge in AI-hallucinated citations necessitates immediate action. You should implement robust screening processes, potentially integrating AI-powered tools like Grounded AI's Veracity, and conduct manual verification for flagged references. This proactive approach is crucial to maintain the credibility of published research and prevent a "flood of fake references" from undermining academic trust.

Key insights

AI-generated hallucinated citations are rapidly increasing, polluting academic literature with fabricated and inaccurate references.

Principles

Method

Grounded AI's Veracity tool checks citations against scholarly databases, flagging invalid, irrelevant, or retracted work, assigning a risk score based on major issues and AI generation likelihood.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Scientist, Research Scientist, Policy Maker

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine learning : nature.com subject feeds.