MixerCA: An Efficient and Accurate Model for High-Performance Hyperspectral Image Classification

· Source: cs.CV updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Remote Sensing · Depth: Expert, extended

Summary

MixerCA is a novel, lightweight deep learning model designed for high-performance hyperspectral image (HSI) classification, integrating depthwise convolution, token and channel mixing, and coordinate attention. This architecture efficiently captures complex spatial and spectral features while maintaining consistent resolution and directly processing HSI patches. Extensive experiments on four benchmark datasets—Pavia University, Salinas, Gulfport of Mississippi, and Xuzhou—demonstrated MixerCA's superior performance compared to existing algorithms like 2D-CNN, 3D-CNN, Tri-CNN, HybridSN, ViT, and Swin Transformer. For instance, on the Pavia University dataset, MixerCA achieved an Overall Accuracy (OA) of 97.81%, outperforming HybridSN's 95.67%. The model also exhibits high computational efficiency, requiring 19,145,472 FLOPs and 9,318,144 MACs, significantly less than HybridSN's 97,483,008 FLOPs and 8,741,504 MACs.

Key takeaway

For AI Engineers and Research Scientists working on hyperspectral image classification, MixerCA offers a compelling balance of high accuracy and computational efficiency. You should consider adopting MixerCA, especially for applications with limited training data or resource constraints, as it consistently outperforms other models across various datasets and low training percentages. Evaluate its performance on your specific HSI tasks to leverage its robust feature extraction and reduced resource demands.

Key insights

MixerCA efficiently classifies hyperspectral images by integrating depthwise convolutions, token/channel mixing, and coordinate attention.

Principles

Method

MixerCA preprocesses HSI data with PCA, then uses multi-scale depthwise convolutions, token/channel mixing, and Coordinate Attention, followed by global average pooling and dense layers for classification.

In practice

Topics

Code references

Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.