DDF2Pol: A Dual-Domain Feature Fusion Network for PolSAR Image Classification
Summary
DDF2Pol is a novel, lightweight dual-domain convolutional neural network designed for PolSAR image classification. This architecture features two parallel streams: one processes real-valued feature descriptors, and the other handles complex-valued PolSAR data, both utilizing 3D convolutional layers to capture spatial and polarimetric information. The model further refines features using a depth-wise convolution layer for spatial enhancement and a coordinate attention mechanism to focus on informative regions. Evaluated on the Flevoland and San Francisco benchmark datasets, DDF2Pol achieved an Overall Accuracy (OA) of 98.16% and 96.12% respectively, outperforming several existing real- and complex-valued models. With only 91,371 parameters, DDF2Pol offers an efficient solution, demonstrating strong performance even with limited training data, such as 0.25% of labeled samples.
Key takeaway
For Computer Vision Engineers developing PolSAR classification systems, DDF2Pol demonstrates that combining real and complex-valued feature streams with efficient spatial refinement and attention mechanisms can yield superior accuracy with significantly reduced model complexity. You should consider adopting a dual-domain approach and incorporating depth-wise convolutions and coordinate attention, especially when working with limited labeled data, to achieve robust and scalable classification performance.
Key insights
DDF2Pol fuses real and complex PolSAR features with attention for efficient, high-accuracy classification.
Principles
- Dual-domain processing captures complementary PolSAR features.
- Depth-wise convolution refines spatial features efficiently.
- Coordinate attention enhances feature discrimination.
Method
DDF2Pol employs parallel 3D CNN streams for real and complex PolSAR data, followed by depth-wise convolution for spatial refinement and a coordinate attention block to emphasize relevant features before global average pooling for classification.
In practice
- Use 3D convolutions for efficient PolSAR feature extraction.
- Integrate depth-wise convolution to reduce parameter count.
- Apply coordinate attention for improved accuracy with low overhead.
Topics
- PolSAR Image Classification
- Dual-Domain CNN
- Complex-Valued CNNs
- Depthwise Convolution
- Coordinate Attention
Code references
Best for: Computer Vision Engineer, AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.