Stitching and dimensionality effects on large artificially generated volume datasets
Summary
This study investigated stitching artifacts and dimensionality effects (2D vs 3D) in generating large images via deep learning, specifically using cycleGAN models trained on cryo-electron microscopy datasets. The research addressed how patching input data to accommodate hardware memory limitations and then assembling output patches can introduce artifacts. It evaluated both perceptual quality and performance on downstream mitochondria segmentation. Key findings include that FID scores fail to detect subtle stitching artifacts that significantly impact segmentation. While 3D models with artifact-free stitching marginally outperform 2D models on downstream tasks, the improvement barely justifies the computational cost. Additionally, 2D models train more stably due to larger batch sizes. The study also demonstrated that ensembling predictions from three orthogonal directions can improve low-quality volumes but offers no benefit for high-quality outputs, highlighting the need for careful artifact mitigation and the inadequacy of perceptual metrics for evaluating domain adaptation in biomedical imaging.
Key takeaway
For Machine Learning Engineers developing generative models for large scientific datasets, particularly in biomedical imaging, you must prioritize robust stitching artifact mitigation. Relying solely on perceptual metrics like FID scores is insufficient, as subtle artifacts can severely impact downstream segmentation performance. Consider 2D models for more stable training and lower computational overhead, reserving 3D models only when marginal performance gains definitively justify the increased cost. Additionally, explore ensembling predictions for improving low-quality volume outputs.
Key insights
Stitching artifacts in large image generation significantly degrade downstream segmentation, undetected by perceptual metrics like FID scores.
Principles
- Perceptual metrics alone are insufficient for biomedical image quality evaluation.
- 3D models offer marginal gains over 2D for high computational cost.
- 2D models provide more stable training with larger batch sizes.
Method
Investigated three stitching approaches and two patch dimensionalities (2D vs 3D) using cycleGAN models on cryo-electron microscopy datasets, evaluating perceptual quality and downstream segmentation.
In practice
- Mitigate stitching artifacts for generative model performance.
- Consider 2D models for stable training and lower cost.
- Ensemble predictions for low-quality volume improvement.
Topics
- Generative Models
- Stitching Artifacts
- CycleGAN
- Cryo-electron Microscopy
- Image Segmentation
- Perceptual Metrics
Best for: Computer Vision Engineer, AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.