Clinically-Informed Modeling for Pediatric Brain Tumor Classification from Whole-Slide Histopathology Images

· Source: cs.CV updates on arXiv.org · Field: Science & Research — Health & Medical Research, Mathematics & Computational Sciences · Depth: Expert, extended

Summary

A new expert-guided contrastive fine-tuning framework has been developed for pediatric brain tumor diagnosis from whole-slide histopathology images (WSIs). This approach addresses challenges like data scarcity, class imbalance, and fine-grained morphological overlap by integrating contrastive learning into slide-level multiple instance learning (MIL). The framework uses a frozen UNI2-h pathology foundation model for patch-level feature extraction and a multi-branch CLAM architecture for slide-level classification. It includes both a general supervised contrastive setting and an expert-guided variant that incorporates clinically informed hard negatives targeting diagnostically confusable subtypes. Experiments on a 237-WSI dataset from Dell Children's Medical Center, across 2-, 3-, 6-, and 7-class tasks, demonstrate that contrastive fine-tuning, particularly in multi-class settings, measurably improves fine-grained diagnostic distinctions and representation geometry, with optimal performance at intermediate contrastive loss weights.

Key takeaway

For Computer Vision Engineers developing diagnostic tools for pediatric pathology, integrating expert-guided contrastive learning into your WSI classification pipelines can significantly improve fine-grained tumor subtype differentiation, especially in data-scarce scenarios. You should experiment with intermediate contrastive loss weights to balance regularization and classification accuracy, and consider incorporating clinical knowledge to define hard negative pairs for better class separation.

Key insights

Contrastive learning, especially with expert-guided hard negatives, improves fine-grained pediatric brain tumor classification from WSIs.

Principles

Method

The method involves segmenting WSIs into patches, extracting 1536-dimensional features using a frozen UNI2-h model, and feeding these into a multi-branch CLAM architecture. A queue-based supervised contrastive loss, with or without expert-guided hard negatives, regularizes slide-level representations during MIL fine-tuning.

In practice

Topics

Best for: Computer Vision Engineer, AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.