Bootstrapping-based Regularisation for Reducing Individual Prediction Instability in Clinical Risk Prediction Models

· Source: stat.ML updates on arXiv.org · Field: Health & Wellbeing — Health & Medical Research, Clinical Care & Medical Practice, Medical Devices & Health Technology · Depth: Advanced, extended

Summary

A novel bootstrapping-based regularisation framework has been developed to enhance the individual prediction stability of deep learning clinical risk models. This method embeds the bootstrapping process directly into the training of deep neural networks, constraining prediction variability across resampled datasets to produce a single, stable model. Evaluated against conventional and ensemble models using simulated data and three clinical datasets (GUSTO-I, Framingham, SUPPORT), the proposed approach consistently demonstrated improved prediction stability with lower mean absolute differences (e.g., 0.019 vs. 0.059 in GUSTO-I; 0.057 vs. 0.088 in Framingham) and significantly fewer deviating predictions. Crucially, it maintained discriminative performance (AUC) and feature importance consistency (high SHAP correlations, e.g., 0.894 for GUSTO-I), addressing the interpretability issues often associated with ensemble methods.

Key takeaway

Research scientists developing clinical prediction models should consider implementing this bootstrapping-based regularisation framework. This approach offers a practical route to achieve greater robustness and reproducibility in deep learning models, particularly in data-limited healthcare settings, without compromising the interpretability essential for clinical trust and adoption. You can achieve ensemble-like stability within a single, explainable model.

Key insights

A novel regularisation method improves clinical prediction model stability without sacrificing interpretability, crucial for healthcare adoption.

Principles

Method

The method introduces a penalised likelihood function that includes a regularisation term measuring the expected difference in predictions between the target model and pre-computed bootstrapped models, balancing data fit with diversity.

In practice

Topics

Best for: Research Scientist, AI Researcher, AI Scientist, AI Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.