Chatbots Keep Telling Stories About Lighthouse Keeper 'Elias Thorne'. We Might Know Why

· Source: 404media Feed · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Intermediate, medium

Summary

Large language models, including ChatGPT, Gemini, Claude, Grok, and Deepseek, consistently generate stories featuring a character named "Elias Thorne", often as a lighthouse keeper or clockmaker. This phenomenon, first observed by software engineer Daniel May in early 2026, has led to a surge in Google Trends searches for "Elias Thorne" and "lighthouse keeper." Cornell University researchers Sil Hamilton and David Mimno, in their May 2026 paper "Elias in the Lighthouse, Again?", sampled 20,000 stories from multiple LLMs and discovered that 11 specific words, including "Elias" and "lighthouse keeper," appear in over 88% of outputs. They attribute this to model safety and alignment tuning, suggesting a "bottleneck" effect where safe, repetitive themes from foundational training datasets like WildChat (derived from GPT-3.5 and containing 166 "Elias" conversations) are inadvertently propagated across models and subsequent datasets. This "Elias Thorne" narrative has since proliferated beyond chatbots into Amazon's self-published AI-generated books, YouTube "slop" content, and fake news sites, often depicting him as a tragic figure.

Key takeaway

For tech journalists and content creators evaluating AI-generated material, understanding the "Elias Thorne" phenomenon is crucial. You should recognize that LLMs can inadvertently propagate specific narrative tropes and fictional personas across platforms. This stems from training data lineage and safety alignment. Rigorous verification of AI-sourced information and author identities is essential. This prevents the spread of repetitive or misleading content, especially in self-published works.

Key insights

LLMs propagate specific narrative patterns like "Elias Thorne" due to training data lineage and safety alignment, leading to widespread, repetitive outputs.

Principles

Method

Researchers sampled 20,000 stories from OpenAI's ChatGPT, Anthropic's Claude, Google's Gemini, and the Allen Institute for AI's chatbot using five prompts to identify common narrative elements.

In practice

Topics

Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, Machine Learning Engineer, Tech Journalist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by 404media Feed.