Learning neuroimaging models from health system-scale data
Summary
Prima, an AI foundation model for neuroimaging, was developed using over 220,000 MRI studies from a large academic health system to address rising demand and physician burnout in neurological disease evaluation. This model utilizes a hierarchical vision architecture to generate general and transferable MRI features, supporting real-world clinical MRI inputs. In a one-year health system-wide study involving 29,431 MRI scans, Prima achieved a mean diagnostic Area Under the Curve (AUC) of 92.0% across 52 radiologic diagnoses, outperforming other general and medical AI models. The system also provides explainable differential diagnoses, worklist prioritization for radiologists, and clinical referral recommendations, while demonstrating algorithmic fairness across diverse patient groups. The model parameters and code, implemented in Python with PyTorch and Transformers, are publicly available under an MIT license for investigational use.
Key takeaway
For AI scientists and neuroradiology teams developing diagnostic tools, Prima demonstrates that training on health system-scale MRI data significantly boosts diagnostic accuracy and clinical utility. Your efforts should focus on integrating similar large-scale, real-world datasets and hierarchical vision architectures to create robust, explainable, and fair AI models that can genuinely alleviate clinical burdens and improve patient outcomes.
Key insights
Health system-scale data enables AI foundation models to achieve superior, fair, and explainable neuroimaging diagnostics.
Principles
- Large-scale, real-world data improves AI diagnostic accuracy.
- Hierarchical vision architectures enhance MRI feature transferability.
- Algorithmic fairness is achievable across sensitive patient groups.
Method
Prima uses a hierarchical vision transformer trained with a CLIP objective on 3D MRI volumes and summarized radiology reports, followed by transfer learning with an MLP for diagnostic prediction.
In practice
- Utilize GPT-3.5-turbo for radiology report summarization.
- Implement VQ-VAE for MRI volume tokenization.
- Apply LIME for explainable AI predictions in clinical context.
Topics
- Neuroimaging AI
- AI Foundation Models
- Medical Diagnostics
- Algorithmic Fairness
- Health System Data
Code references
Best for: Computer Vision Engineer, AI Scientist, Research Scientist, AI Engineer, Machine Learning Engineer, AI Researcher
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine learning : nature.com subject feeds.