EAGT: Echocardiography Augmentation for Generalisability and Transferability

· Source: cs.CV updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Health & Medical Research, Data Science & Analytics · Depth: Expert, extended

Summary

This study systematically evaluates 29 data augmentation techniques and their pairwise combinations to improve the cross-dataset generalizability of deep learning models for 2D left ventricular (LV) segmentation in echocardiography. Using a U-Net architecture, models were trained on Unity, CAMUS, and EchoNet Dynamic datasets and evaluated in-domain and across datasets using Dice and IoU scores. The research found that anatomically plausible geometric transformations, such as affine, shift-scale-rotate, perspective, and random horizontal flip, significantly enhance cross-dataset performance. Conversely, aggressive intensity- or artifact-based augmentations often degrade generalizability. Pairwise combinations, particularly moderate flip-centric ones like random horizontal flip with affine, consistently yielded greater gains than individual augmentations, providing empirically grounded guidance for designing robust augmentation policies for echocardiography segmentation models.

Key takeaway

For Computer Vision Engineers developing echocardiography segmentation models, you should prioritize geometric data augmentations like affine, shift-scale-rotate, perspective, and random horizontal flip to enhance cross-dataset generalizability. Combining random horizontal flip with affine transformations offers particularly robust performance improvements, especially when transferring models between different scanner vendors or acquisition protocols. Avoid overly aggressive intensity-based augmentations, as they can degrade performance in cross-dataset scenarios.

Key insights

Geometric data augmentations significantly improve echocardiography model generalizability across diverse datasets, outperforming intensity-based methods.

Principles

Method

A U-Net model was trained on three echocardiography datasets, evaluating 29 individual and selected pairwise augmentation techniques with multiple hyperparameter settings, using Dice and IoU metrics and t-tests for statistical significance.

In practice

Topics

Code references

Best for: Computer Vision Engineer, AI Scientist, Research Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.