Multi-Stage Bi-Atrial Segmentation Framework from 3D Late Gadolinium-Enhanced MRI using V-Net Family Models

· Source: cs.CV updates on arXiv.org · Field: Science & Research — Health & Medical Research, Engineering & Applied Sciences, Mathematics & Computational Sciences · Depth: Advanced, medium

Summary

A multi-stage framework has been developed for multi-class bi-atrial segmentation from 3D late gadolinium-enhanced (LGE) MRI of the human heart, addressing a challenge held as part of MICCAI 2024. The pipeline incorporates a preprocessing step using multidimensional contrast limited adaptive histogram equalization (MCLAHE) to enhance raw MRI scans, followed by a two-stage segmentation process. The first stage performs coarse region detection of the atrium using a V-Net family model on down-sampled data, while the second stage executes fine multi-class segmentation (left atrium, right atrium, wall, and background) on a cropped region using another V-Net model. Asymmetric loss is applied for model optimization, with fixed hyperparameters γ₊=1, γ₋=4, and m=0.05. Experiments on a public validation set of 30 3D LGE-MRI scans showed that a vanilla V-Net model with MCLAHE preprocessing achieved the best performance, outperforming a more complex V-Net++ model in terms of Dice and HD95 metrics.

Key takeaway

For Computer Vision Engineers developing medical image segmentation models, you should prioritize robust preprocessing techniques like MCLAHE, as it significantly improves model performance and transferability, even with simpler architectures. Do not solely rely on training loss curves; validate models on diverse datasets to assess their real-world effectiveness, especially for imbalanced tasks like bi-atrial segmentation.

Key insights

A multi-stage V-Net framework with MCLAHE preprocessing effectively segments bi-atrial regions from 3D LGE-MRI.

Principles

Method

The method involves MCLAHE preprocessing, coarse atrium region detection using a V-Net on down-sampled data, and fine multi-class segmentation on a cropped region with another V-Net, optimized by asymmetric loss.

In practice

Topics

Code references

Best for: Computer Vision Engineer, AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.