Using Cognitive Models to Improve Language Model Simulation of Human Persuasion Games

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Social Sciences & Behavioral Studies · Depth: Expert, quick

Summary

A new approach, "Equation-to-Behavior Prompting," guides large language models to simulate diverse human decision-making in strategic interactions, addressing the common failure of current LLM simulations to cover the full spectrum of human behavior, from Bayesian updating to motivated reasoning. This method leverages mathematical models from cognitive science and economics. Researchers found that large models can approximate complex equation-based specifications, including Bayesian updating, affine distortion, motivated updating, and Grether's α-β model, through prompting. Small models, however, struggled with this approach. To overcome this, the study introduces "Equation-to-Behavior RL," a reinforcement learning method that reduces belief error by 26.5% in out-of-distribution parameterizations for small models. These enhanced simulations create diverse training environments, improving average belief change by 2.5%-12% over Bayesian-only training when persuading models like GPT-5-mini.

Key takeaway

For AI Scientists developing human-like simulations, you should integrate cognitive models to capture the full spectrum of human decision-making beyond simple Bayesian updating. This approach, using "Equation-to-Behavior Prompting" or "Equation-to-Behavior RL" for smaller models, significantly enhances the diversity and realism of simulated agents. Your training environments will become more robust, leading to improved model performance in complex strategic interactions and better safety evaluations.

Key insights

Cognitive models and RL can guide LLMs to simulate diverse human decision-making, improving training environments.

Principles

Method

"Equation-to-Behavior Prompting" uses mathematical models to guide LLMs. For small models, "Equation-to-Behavior RL" applies reinforcement learning to adhere to these rules.

In practice

Topics

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.