A Proactive EMR Assistant for Doctor-Patient Dialogue: Streaming ASR, Belief Stabilization, and Preliminary Controlled Evaluation
Summary
A new end-to-end proactive Electronic Medical Record (EMR) assistant has been developed to provide real-time consultation support during doctor-patient dialogues, moving beyond passive post-consultation note generation. This system integrates streaming Automatic Speech Recognition (ASR), punctuation restoration, stateful information extraction, belief stabilization, objectified retrieval, action planning, and replayable report generation. Evaluated in a preliminary controlled setting using ten streamed doctor-patient dialogues and a 300-query retrieval benchmark, the full system achieved a state-event F1 of 0.84, retrieval Recall@5 of 0.87, and end-to-end pilot scores of 83.3% coverage, 81.4% structural completeness, and 80.0% risk recall. Ablation studies suggest that both punctuation restoration and belief stabilization significantly improve downstream extraction, retrieval, and action selection within this pilot, demonstrating the technical coherence and directional support of the online architecture under tightly controlled conditions.
Key takeaway
For NLP engineers developing real-time clinical documentation systems, you should prioritize integrating streaming ASR with robust punctuation restoration and belief stabilization. This architecture, demonstrated to improve extraction, retrieval, and action selection in pilot studies, offers a concrete baseline for building proactive EMR assistants. Your next steps should involve validating this pipeline on larger, more diverse prospective clinical data to assess real-world performance and clinical utility.
Key insights
Proactive EMR assistants require streaming ASR, belief stabilization, and objectified retrieval for real-time consultation support.
Principles
- Punctuation is critical for downstream NLP tasks.
- Stabilize raw LM outputs for reliable clinical decisions.
- Objectify content for enhanced retrieval quality.
Method
The system uses streaming ASR, punctuation restoration, stateful extraction, belief stabilization, hybrid retrieval, and POMDP-lite action planning to maintain a dynamic state and suggest next actions.
In practice
- Implement punctuation restoration after ASR.
- Apply temperature scaling and exponential smoothing to belief states.
- Anchor medical objects for traceability and replay.
Topics
- Proactive EMR Assistant
- Streaming ASR
- Punctuation Restoration
- Belief Stabilization
- Hybrid Retrieval
Best for: NLP Engineer, AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.