H-AdminSim: A Multi-Agent Simulator for Realistic Hospital Administrative Workflows with FHIR Integration
Summary
H-AdminSim is a novel, comprehensive multi-agent simulation framework designed to evaluate Large Language Model (LLM)-based automation in hospital administrative workflows, specifically focusing on first-visit outpatient scenarios. It integrates realistic synthetic data generation, multi-agent dialogue simulation for patient intake and appointment scheduling, and Fast Healthcare Interoperability Resources (FHIR) for unified data exchange across heterogeneous hospital settings. The framework simulates hospitals of varying scales (primary, secondary, tertiary) with diverse patient backgrounds, including 194 disease-symptom pairs across nine internal medicine specialties. Quantitative evaluation uses detailed rubrics for tasks like department assignment and appointment scheduling, including rescheduling and cancellation. Experiments with GPT-5 Mini, GPT-5 Nano, and Gemini 2.5 Flash demonstrate that while tool-based scheduling is stable, patient intake, particularly department assignment accuracy, remains a significant bottleneck for LLM performance.
Key takeaway
For AI Scientists and Machine Learning Engineers developing LLM-based healthcare solutions, you should prioritize robust tool-based scheduling pipelines for appointment management, as pure LLM reasoning is significantly less reliable. Focus your research and development efforts on enhancing LLM performance in patient intake and department assignment, as this remains the primary challenge. Consider integrating FHIR standards early in your simulation and deployment strategies to ensure interoperability and scalability across diverse hospital environments.
Key insights
H-AdminSim provides a FHIR-integrated multi-agent simulation for evaluating LLM automation in complex hospital administrative workflows.
Principles
- Realistic simulation requires diverse patient profiles and hospital scales.
- Tool-based scheduling outperforms pure LLM reasoning.
- Patient intake is a primary bottleneck for LLM-driven administration.
Method
H-AdminSim uses hierarchical synthetic data generation, multi-agent dialogue for patient intake and scheduling, and FHIR integration for real-time hospital information system updates, evaluated via detailed rubrics.
In practice
- Use tool-calling for LLM-based scheduling to ensure stability.
- Prioritize improving LLM performance in patient intake dialogues.
- Leverage FHIR for interoperable healthcare data simulation.
Topics
- Multi-Agent Simulation
- Hospital Administrative Workflows
- FHIR Integration
- LLM Evaluation
- Patient Intake
Best for: AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.