Explainable Flood Segmentation on Sentinel-1 SAR Imagery: A Comparative Study of CNN and Transformer Architectures

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computer Vision & Pattern Recognition, Environmental Science & Earth Systems · Depth: Expert, quick

Summary

A comprehensive study compared convolutional neural network (CNN) and vision transformer architectures for multi-class flood segmentation using Sentinel-1 Synthetic Aperture Radar (SAR) imagery. The research focused on distinguishing flooded land from permanent water bodies and land, crucial for disaster response. Three CNN models (U-Net, U-Net++, DeepLabV3 with ResNet-34) and three SegFormer variants (b0, b1, b2) were evaluated on the ETCI NASA dataset and SenFloods11, employing scene-based data splits for realistic spatial generalization assessment. SegFormer-b2 significantly outperformed the U-Net baseline on the ETCI dataset, showing higher flood IoU across all 7 test scenes. While this advantage narrowed on Sen1Floods11 after fine-tuning, it remained notable for spatially fragmented flood events. The study also incorporated explainability techniques, revealing SegFormer-b2's more spatially coherent Grad-CAM activations and U-Net's superior uncertainty estimates along flood boundaries.

Key takeaway

For Machine Learning Engineers developing flood segmentation systems, this study suggests considering SegFormer-b2 as a primary architecture, especially for its strong performance on SAR imagery. If your application requires precise flood boundary uncertainty estimates, U-Net might be a more suitable choice. Always validate models using scene-based data splits to ensure robust spatial generalization in real-world disaster response scenarios.

Key insights

SegFormer-b2 excels in SAR flood segmentation, outperforming CNNs on specific datasets, with distinct explainability characteristics.

Principles

Method

Compared U-Net, U-Net++, DeepLabV3, and SegFormer variants for multi-class flood segmentation on Sentinel-1 SAR imagery. Evaluated on ETCI and SenFloods11 datasets using scene-based splits, incorporating qualitative and quantitative explainability.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.