Meituan Merchant Business Diagnosis via Policy-Guided Dual-Process User Simulation
Summary
Meituan has developed Policy-Guided Hybrid Simulation (PGHS), a dual-process framework designed to simulate group-level user behavior for evaluating merchant strategies without expensive online experiments. PGHS addresses two key challenges in user simulation: information incompleteness, which leads reasoning-based simulators to over-rationalize, and mechanism duality, which requires capturing both explicit preferences and implicit statistical regularities. The framework mines transferable decision policies from behavioral trajectories, using them as a shared alignment layer. This layer anchors an LLM-based reasoning branch to prevent over-rationalization and an ML-based fitting branch to absorb implicit regularities. The group-level predictions from both branches are fused for complementary correction. Deployed on Meituan with 101 merchants and over 26,000 trajectories, PGHS achieved a group simulation error of 8.80%, outperforming the best reasoning-based and fitting-based baselines by 45.8% and 40.9% respectively.
Key takeaway
For research scientists developing user behavior simulators, PGHS offers a robust framework to overcome limitations of purely reasoning-based or fitting-based approaches. You should consider integrating policy-guided dual-process simulation to achieve higher accuracy in counterfactual evaluations, especially when dealing with complex user behaviors influenced by both explicit preferences and implicit factors.
Key insights
A dual-process simulation framework improves user behavior prediction by combining LLM reasoning with ML fitting.
Principles
- Combine reasoning and fitting for robust simulation.
- Policy guidance aligns diverse simulation models.
Method
PGHS mines decision policies from trajectories, using them to align an LLM-based reasoning branch and an ML-based fitting branch, then fuses their group-level predictions.
In practice
- Use policy guidance to anchor LLM and ML models.
- Fuse diverse model outputs for error correction.
Topics
- Policy-Guided Hybrid Simulation
- User Behavior Simulation
- Merchant Business Diagnosis
- LLM-based Reasoning
- Machine Learning Models
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.