Predicting relations between SOAP note sections: The value of incorporating a clinical information model

· Source: Explosion · Developer tools and consulting for AI, Machine Learning and NLP - Explosion.ai · Field: Health & Wellbeing — Clinical Care & Medical Practice, Medical Devices & Health Technology, Health & Medical Research · Depth: Intermediate, quick

Summary

To support human annotation for predicting relations between SOAP note sections, a specific methodology was employed, focusing on "Assessment and Plan" subsections. Initially, 100 "Assessment and Plan" subsections were manually annotated using Prodigy. This initial manual effort provided the necessary data to fine-tune a spacy-transformers RoBERTa-base model. This model, originally pretrained on OntoNotes 5, was adapted for Named Entity Recognition (NER) tagging. The fine-tuning process specifically targeted both the "Assessment" and "Plan" sections, aiming to enhance the efficiency and accuracy of subsequent human annotation tasks by providing automated tagging capabilities.

Key takeaway

For NLP Engineers tasked with improving annotation efficiency for clinical notes, consider an iterative approach combining manual annotation with model fine-tuning. You should start with targeted manual annotation of a small dataset, like 100 SOAP note subsections, using tools such as Prodigy. Then, fine-tune a robust general-domain model, such as RoBERTa-base, on your specific clinical data to automate Named Entity Recognition for sections like "Assessment" and "Plan," significantly reducing subsequent manual effort.

Key insights

Fine-tuning a RoBERTa-base model on manually annotated clinical data improves NER for SOAP note sections.

Principles

Method

Manually annotate 100 "Assessment and Plan" subsections with Prodigy. Fine-tune a spacy-transformers RoBERTa-base model (pretrained on OntoNotes 5) for NER tagging on "Assessment" and "Plan" sections.

In practice

Topics

Best for: AI Scientist, Machine Learning Engineer, NLP Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Explosion · Developer tools and consulting for AI, Machine Learning and NLP - Explosion.ai.