A Proactive EMR Assistant for Doctor-Patient Dialogue: Streaming ASR, Belief Stabilization, and Preliminary Controlled Evaluation

· Source: cs.AI updates on arXiv.org · Field: Health & Wellbeing — Medical Devices & Health Technology, Clinical Care & Medical Practice · Depth: Expert, long

Summary

A new end-to-end proactive Electronic Medical Record (EMR) assistant has been developed to provide real-time consultation support during doctor-patient dialogues, moving beyond passive post-consultation note generation. This system integrates streaming Automatic Speech Recognition (ASR), punctuation restoration, stateful information extraction, belief stabilization, objectified retrieval, action planning, and replayable report generation. Evaluated in a preliminary controlled setting using ten streamed doctor-patient dialogues and a 300-query retrieval benchmark, the full system achieved a state-event F1 of 0.84, retrieval Recall@5 of 0.87, and end-to-end pilot scores of 83.3% coverage, 81.4% structural completeness, and 80.0% risk recall. Ablation studies suggest that both punctuation restoration and belief stabilization significantly improve downstream extraction, retrieval, and action selection within this pilot, demonstrating the technical coherence and directional support of the online architecture under tightly controlled conditions.

Key takeaway

For NLP engineers developing real-time clinical documentation systems, you should prioritize integrating streaming ASR with robust punctuation restoration and belief stabilization. This architecture, demonstrated to improve extraction, retrieval, and action selection in pilot studies, offers a concrete baseline for building proactive EMR assistants. Your next steps should involve validating this pipeline on larger, more diverse prospective clinical data to assess real-world performance and clinical utility.

Key insights

Proactive EMR assistants require streaming ASR, belief stabilization, and objectified retrieval for real-time consultation support.

Principles

Method

The system uses streaming ASR, punctuation restoration, stateful extraction, belief stabilization, hybrid retrieval, and POMDP-lite action planning to maintain a dynamic state and suggest next actions.

In practice

Topics

Best for: NLP Engineer, AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.