Can LLMs Replace Survey Respondents?

· Source: Towards Data Science · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Economic Analysis & Policy · Depth: Expert, medium

Summary

Large language models (LLMs) can replicate the median responses of household surveys, such as the Survey of Consumer Expectations (SCE), for inflation expectations to within a percentage point (e.g., 3% in 2020). However, LLMs suffer from "mode collapse," where 95% of simulated respondents cluster within a two-percentage-point window, failing to capture the wide dispersion of real human responses (e.g., -25% to +27% in 2020 SCE). This issue, likely caused by LLMs retrieving memorized statistics, is not resolved by standard prompting. Researchers applied unlearning methods, Gradient Ascent (GA) and Negative Preference Optimization (NPO), to Llama-3.1-8B-Instruct. GA achieved 97% tail accuracy against the SCE benchmark of 44% (responses >3pp from mode), and NPO achieved 98%, significantly improving distributional realism. In replicating a randomized controlled trial (RCT), Llama-GA successfully reproduced treatment effects and dispersion, unlike baseline or NPO models, suggesting unlearning enables more realistic synthetic agent behavior for complex simulations.

Key takeaway

For AI Scientists and Machine Learning Engineers developing synthetic survey agents, you must prioritize distributional accuracy over just matching averages. Standard LLMs exhibit "mode collapse," making them unsuitable for replicating heterogeneous populations or complex randomized controlled trials. You should explore unlearning methods like Gradient Ascent to remove memorized statistics. This enables models to generate more realistic response distributions and accurately simulate treatment effects. Always report second moments to validate your synthetic populations.

Key insights

LLMs can match survey means but fail to reproduce population dispersion due to "mode collapse" from memorized training data.

Principles

Method

Apply Gradient Ascent (GA) or Negative Preference Optimization (NPO) to unlearn specific data, maximizing prediction loss on forget sets while retaining general capabilities.

In practice

Topics

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Towards Data Science.