Yet another experiment proves it's too damn simple to poison large language models
Summary
A security engineer, Ron Stoner, demonstrated a simple yet effective method of poisoning large language models (LLMs) by fabricating a non-existent 6 Nimmt! card game championship. Stoner registered a $12 domain, 6nimmt.com, and created a Wikipedia entry listing himself as the 2025 world champion, citing his own domain as the source. This single, uncorroborated source was sufficient to convince several AI chatbots with web search capabilities to confidently report him as the champion. The experiment highlights three failure modes: immediate retrieval layer poisoning, potential inclusion of false data in model training corpora if the Wikipedia edit persisted, and the risk of malicious actions by AI agents if poisoned sources dictate their tool access. Stoner's goal was to spur discussion on LLM trust, source provenance, and the need for improved heuristic filtering of suspicious web content.
Key takeaway
For CTOs and VPs of Engineering overseeing AI/ML initiatives, your teams must prioritize robust data provenance and implement sophisticated heuristic filtering for retrieval-augmented generation (RAG) systems. Relying solely on LLMs to "figure out" source trustworthiness is a critical vulnerability. You should invest in mechanisms to detect suspicious patterns, such as single citations to newly registered domains, to mitigate the risk of both reputational damage from misinformation and potential security threats from poisoned AI agents.
Key insights
LLMs with web search are vulnerable to simple, low-cost data poisoning via fabricated online sources.
Principles
- LLMs prioritize highly ranked retrieval results.
- AI models struggle to discern source provenance.
- Corpus poisoning is a persistent cleanup problem.
Method
Register a domain, create a Wikipedia entry citing the domain, and allow LLMs with web search to retrieve the fabricated information.
In practice
- Implement strong data provenance checks.
- Heuristically filter recent, single-source web content.
- Educate users on RAG data pipeline limitations.
Topics
- Large Language Models
- Retrieval-Augmented Generation
- Data Poisoning
- Misinformation Tactics
- AI Agents
Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Security Engineer, MLOps Engineer, AI Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Register: Enterprise Technology News and Analysis.