Implementing trust in non-small cell lung cancer diagnosis with a conformalized uncertainty-aware AI framework

· Source: Machine learning : nature.com subject feeds · Field: Health & Wellbeing — Medical Devices & Health Technology, Health & Medical Research, Clinical Care & Medical Practice · Depth: Expert, extended

Summary

TRUECAM (Trustworthiness-focused, Uncertainty-aware, End-to-end Cancer diagnosis with Model-agnostic capabilities) is a novel AI framework for non-small cell lung cancer (NSCLC) subtyping using whole-slide images. It integrates a spectral-normalized neural Gaussian process (SNGP) for out-of-scope input identification, ambiguity-guided tile elimination (EAT) to filter ambiguous regions, and conformal prediction (CP) for controlled error rates. Evaluated across multiple cancer datasets, including over 20,000 WSIs, and with both task-specific (Inception-v3) and foundation models (UNI, CONCH, Prov-GigaPath, TITAN), TRUECAM consistently improved classification accuracy, robustness, interpretability, data efficiency, and fairness. For instance, Inception-v3 with TRUECAM reduced error rates by 72.0% (1-α=0.95) and 93.8% (1-α=0.99). It also reduced racial accuracy gaps by 38.1% (TCGA) and 78.3% (CPTAC).

Key takeaway

For AI Scientists developing diagnostic tools, integrating TRUECAM into your pathology AI pipeline is crucial for deploying trustworthy systems. This framework allows you to achieve statistically guaranteed error rates, robustly handle out-of-domain data, and enhance fairness across demographic groups, significantly reducing misdiagnosis risks. Consider adopting its SNGP, EAT, and CP components to improve model reliability and interpretability in real-world clinical settings.

Key insights

TRUECAM enhances pathology AI trustworthiness by integrating uncertainty quantification, data filtering, and error rate control.

Principles

Method

TRUECAM combines SNGP for OOD detection and uncertainty estimation, ambiguity-guided tile elimination (EAT) to filter noisy regions, and conformal prediction (CP) with conformal risk control (CRC) for statistically guaranteed error rates.

In practice

Topics

Code references

Best for: AI Scientist, Research Scientist, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine learning : nature.com subject feeds.