EMSDialog: Synthetic Multi-person Emergency Medical Service Dialogue Generation from Electronic Patient Care Reports via Multi-LLM Agents
Summary
EMSDialog introduces a novel dataset of 4,414 synthetic multi-speaker Emergency Medical Service (EMS) conversations, generated from real-world Electronic Patient Care Reports (ePCRs). This dataset addresses the scarcity of multi-party medical dialogue corpora needed for conversational diagnosis prediction, which requires models to track evolving evidence in streaming clinical conversations. The generation pipeline utilizes multi-LLM agents, grounded in ePCRs and topic flow, to iteratively plan, generate, and self-refine dialogues with rule-based factual and topic flow checks. EMSDialog is annotated with 43 diagnoses, speaker roles, and turn-level topics. Human and LLM evaluations confirm the high quality and realism of the dataset using both utterance- and conversation-level metrics. Training with EMSDialog significantly improves the accuracy, timeliness, and stability of EMS conversational diagnosis prediction models.
Key takeaway
For research scientists developing conversational AI for medical diagnosis, EMSDialog offers a critical resource to overcome limitations of existing dyadic datasets. You should consider integrating this synthetic multi-speaker dataset into your training regimens to improve model accuracy, timeliness, and stability in tracking evolving evidence within complex clinical conversations. This approach can lead to more robust and reliable diagnostic prediction systems.
Key insights
Synthetic multi-speaker EMS dialogues generated from ePCRs enhance conversational diagnosis prediction accuracy and stability.
Principles
- Multi-agent LLM pipelines can generate high-quality synthetic data.
- ePCRs provide a robust grounding for medical dialogue synthesis.
- Topic flow and factual checks are crucial for dialogue realism.
Method
An ePCR-grounded, topic-flow-based multi-agent generation pipeline iteratively plans, generates, and self-refines dialogues using rule-based factual and topic flow checks to create synthetic multi-speaker EMS conversations.
In practice
- Use multi-LLM agents for complex data generation.
- Ground synthetic data generation in real-world reports.
- Incorporate rule-based checks for factual accuracy.
Topics
- EMSDialog Dataset
- Multi-LLM Agents
- Conversational Diagnosis Prediction
- Electronic Patient Care Reports
- Multi-speaker Dialogue Generation
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.