Adaptive Querying with AI Persona Priors
Summary
This paper introduces "Adaptive Querying with AI Persona Priors," a novel framework for efficiently learning user-dependent quantities of interest, such as responses to unasked items or psychometric indicators, within strict question budgets. The method addresses limitations of classical Bayesian design and computerized adaptive testing (CAT), which often rely on restrictive parametric assumptions or computationally expensive posterior approximations. The core innovation is a persona-induced latent variable model where a user's state is represented by membership in a finite dictionary of AI personas. Each persona provides response distributions generated by a large language model (LLM) offline. This approach yields expressive priors with closed-form posterior updates and efficient finite-mixture predictions, enabling scalable Bayesian experimental design for sequential item selection. Experiments on synthetic data and the WorldValuesBench dataset demonstrate that persona-based posteriors provide accurate probabilistic predictions and an interpretable adaptive elicitation pipeline, outperforming CAT baselines, especially in cold-start scenarios.
Key takeaway
For Machine Learning Engineers developing interactive systems with tight query budgets, consider integrating AI persona priors. This approach offers a scalable and expressive alternative to traditional CAT, particularly beneficial in cold-start scenarios where historical data is scarce. Your team should leverage LLMs to pre-generate persona-specific response distributions, enabling efficient, closed-form Bayesian updates and robust adaptive querying. Be mindful that while greedy adaptive querying excels at small budgets, non-adaptive designs might offer more robustness against model misspecification at larger query budgets.
Key insights
AI personas from LLMs enable efficient, expressive Bayesian adaptive querying with closed-form posterior updates.
Principles
- LLM-generated personas can form expressive Bayesian priors.
- Closed-form posterior updates enhance scalability.
- Greedy adaptive querying balances efficiency and accuracy.
Method
The method involves generating persona-question response distributions offline via LLMs, then online, updating a prior over persona membership sequentially as user answers arrive, enabling efficient Bayesian experimental design.
In practice
- Use GPT-5-mini for persona-conditioned response elicitation.
- Employ the Twin-2K-500 persona bank for diverse profiles.
- Prioritize direct distribution elicitation over point predictions.
Topics
- Adaptive Querying
- AI Personas
- Large Language Models
- Bayesian Experimental Design
- Computerized Adaptive Testing
Code references
Best for: AI Scientist, Research Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.