Building Extraction from Remote Sensing Imagery under Hazy and Low-light Conditions: Benchmark and Baseline

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Remote Sensing · Depth: Expert, quick

Summary

HaLoBuilding is introduced as the first optical benchmark dataset specifically designed for building extraction from remote sensing (RS) imagery under challenging hazy and low-light conditions. Existing methods and benchmarks primarily focus on ideal clear-weather scenarios, leading to performance degradation in real-world applications. This new benchmark utilizes a same-scene multitemporal pairing strategy to ensure pixel-level label alignment and high fidelity, even under extreme atmospheric degradation. Alongside the dataset, the authors propose HaLoBuild-Net, an end-to-end framework for building extraction in adverse RS environments. HaLoBuild-Net incorporates a Spatial-Frequency Focus Module (SFFM) to mitigate meteorological interference, a Global Multi-scale Guidance Module (GMGM) for global semantic constraints, and a Mutual-Guided Fusion Module (MGFM) for semantic-spatial calibration. Experiments show HaLoBuild-Net significantly outperforms other methods on HaLoBuilding and generalizes well to WHU, INRIA, and LoveDA datasets.

Key takeaway

For computer vision engineers developing building extraction models, the HaLoBuilding dataset and HaLoBuild-Net offer a critical advancement for real-world applications. Your current models likely underperform in hazy or low-light conditions; adopting this benchmark and framework can significantly improve robustness and accuracy in adverse weather scenarios. Consider integrating the proposed modules to enhance feature extraction and semantic consistency in your next-generation models.

Key insights

A new benchmark and network address building extraction challenges in hazy and low-light remote sensing imagery.

Principles

Method

HaLoBuild-Net uses SFFM for meteorological interference mitigation, GMGM for global semantic constraints, and MGFM for bidirectional semantic-spatial calibration to extract buildings from degraded RS imagery.

In practice

Topics

Code references

Best for: AI Scientist, Computer Vision Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.