Can Large Language Models Revolutionize Survey Research? Experiments with Disaster Preparedness Responses
Summary
A study evaluates a five-stage framework for integrating large language models (LLMs) into survey research, using the 2024 Hurricane Milton preparedness survey of 946 Florida residents as a testbed. The framework covers questionnaire design, sample selection, pilot testing, missing-data imputation, and post-collection analysis. Researchers introduced a Protection Motivation Theory (PMT)-constrained co-occurrence knowledge graph to develop seven LLM configurations. Their proposed Anchored Marginal Theory-Informed LLM (A-TLM) significantly outperformed three classical imputation baselines (IPW/MI, MICE+PMM, missForest) on root-mean-square error (RMSE) under severe block-wise missing-not-at-random conditions (S4 RMSE 1.439 vs. 1.496 for the next-best method). A-TLM also achieved near-zero overall signed bias (-0.121), contrasting with the random-forest imputer's larger absolute bias (-0.631). The study highlights that near-zero aggregate bias can mask substantial opposing subgroup errors, particularly for compound-vulnerable respondents, and proposes subgroup-stratified bias auditing. A retrieval-constrained knowledge-graph chatbot demonstrated architectural management of hallucination risk.
Key takeaway
For Data Scientists and Research Scientists working with survey data, you should integrate theory-constrained LLMs into your imputation workflows. This approach, particularly the A-TLM, significantly reduces bias for compound-vulnerable subgroups under severe missingness. You must adopt subgroup-stratified bias reporting as a standard practice for policy-relevant applications, avoiding masked critical errors. Consider using LLM-augmented tools for early design stages and data analysis, ensuring architectural grounding to manage hallucination risks.
Key insights
Theory-constrained retrieval and demographic anchoring enable LLMs to improve survey data imputation and analysis, especially for vulnerable groups.
Principles
- Organize retrieval around theoretical structures.
- Integrate all evidence in a single model call.
- Audit subgroup-stratified bias.
Method
A five-stage LLM integration framework covers questionnaire design, sample selection, pilot testing, missing-data imputation, and post-collection analysis, using a PMT-constrained knowledge graph.
In practice
- Use LLMs for instrument audit and gap identification.
- Forecast sample under-representation pre-fielding.
- Ground chatbot answers in knowledge graphs.
Topics
- Large Language Models
- Survey Research
- Missing Data Imputation
- Retrieval-Augmented Generation
- Disaster Preparedness
- Protection Motivation Theory
- Bias Auditing
Best for: AI Scientist, Research Scientist, Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.