Polarization by Default: Auditing Recommendation Bias in LLM-Based Content Curation
Summary
A controlled simulation study investigated content selection biases in Large Language Models (LLMs) used for content curation and ranking. Researchers mapped these biases across three major LLM providers (OpenAI, Anthropic, Google) using real social media datasets from Twitter/X, Bluesky, and Reddit. The study employed six prompting strategies: general, popular, engaging, informative, controversial, and neutral. Through 540,000 simulated top-10 selections from pools of 100 posts across 54 experimental conditions, the findings indicate that biases vary significantly in their structural nature and prompt sensitivity. Polarization was amplified across all configurations, toxicity handling showed an inversion between engagement- and information-focused prompts, and sentiment biases were predominantly negative. GPT-4o Mini exhibited the most consistent behavior, while Claude and Gemini showed high adaptivity in toxicity handling. Gemini also displayed the strongest negative sentiment preference. On Twitter/X, left-leaning authors were systematically over-represented despite forming a minority in the dataset, a pattern that largely persisted across prompts.
Key takeaway
For engineering teams deploying LLMs for content curation, you must actively audit and mitigate inherent biases. Be aware that polarization is amplified by default, and negative sentiment is common. Consider provider-specific trade-offs, such as GPT-4o Mini's consistency versus Claude and Gemini's toxicity adaptivity, and implement diverse prompting strategies to counter specific biases like political leaning over-representation.
Key insights
LLM content curation amplifies polarization and exhibits consistent negative sentiment, with varying provider-specific biases.
Principles
- LLM biases are both structural and prompt-sensitive.
- Polarization is a default outcome in LLM content curation.
- Sentiment bias is predominantly negative across LLM providers.
Method
The study used a controlled simulation with 540,000 top-10 selections from 100-post pools, testing three LLM providers and six prompting strategies on real social media data.
In practice
- GPT-4o Mini offers consistent LLM behavior.
- Claude and Gemini adapt well to toxicity handling.
- Prompt design influences toxicity and sentiment biases.
Topics
- LLM Content Curation
- Recommendation Bias
- Social Media Datasets
- Prompt Engineering
- Polarization Amplification
Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, AI Ethicist, MLOps Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.