MixerCA: An Efficient and Accurate Model for High-Performance Hyperspectral Image Classification
Summary
MixerCA is a novel, lightweight deep learning model designed for high-performance hyperspectral image (HSI) classification, integrating depthwise convolution, token and channel mixing, and coordinate attention. This architecture efficiently captures complex spatial and spectral features while maintaining consistent resolution and directly processing HSI patches. Extensive experiments on four benchmark datasets—Pavia University, Salinas, Gulfport of Mississippi, and Xuzhou—demonstrated MixerCA's superior performance compared to existing algorithms like 2D-CNN, 3D-CNN, Tri-CNN, HybridSN, ViT, and Swin Transformer. For instance, on the Pavia University dataset, MixerCA achieved an Overall Accuracy (OA) of 97.81%, outperforming HybridSN's 95.67%. The model also exhibits high computational efficiency, requiring 19,145,472 FLOPs and 9,318,144 MACs, significantly less than HybridSN's 97,483,008 FLOPs and 8,741,504 MACs.
Key takeaway
For AI Engineers and Research Scientists working on hyperspectral image classification, MixerCA offers a compelling balance of high accuracy and computational efficiency. You should consider adopting MixerCA, especially for applications with limited training data or resource constraints, as it consistently outperforms other models across various datasets and low training percentages. Evaluate its performance on your specific HSI tasks to leverage its robust feature extraction and reduced resource demands.
Key insights
MixerCA efficiently classifies hyperspectral images by integrating depthwise convolutions, token/channel mixing, and coordinate attention.
Principles
- Depthwise convolutions reduce computational complexity.
- Coordinate attention enhances feature discrimination.
- Multi-scale kernels capture diverse spatial details.
Method
MixerCA preprocesses HSI data with PCA, then uses multi-scale depthwise convolutions, token/channel mixing, and Coordinate Attention, followed by global average pooling and dense layers for classification.
In practice
- Use PCA for HSI dimensionality reduction.
- Optimize patch size and PCA count per dataset.
- Consider Coordinate Attention for HSI classification.
Topics
- Hyperspectral Image Classification
- MixerCA Model
- Depthwise Convolution
- Coordinate Attention
- Spatial-Spectral Feature Extraction
Code references
Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.