HyperFM: An Efficient Hyperspectral Foundation Model with Spectral Grouping

2024-12-23 · Source: cs.CV updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Environmental Science & Earth Systems · Depth: Expert, extended

Summary

HyperFM is a new parameter-efficient hyperspectral foundation model designed to address the challenges of analyzing large, complex hyperspectral data, particularly from missions like NASA PACE. Existing foundation models, typically trained on RGB imagery or cloud-free hyperspectral data, struggle with the continuous spectral signatures and cloudy scenes crucial for atmospheric science. HyperFM leverages intra-group and inter-group spectral attention via a "Group Embed" module and employs "Hypoformer" blocks with hybrid parameter decomposition to efficiently capture spectral-spatial relationships. The model demonstrates consistent performance improvements, averaging a 32.36% lower Mean Squared Error (MSE) than the best baseline, HyperSigma, across four atmospheric cloud property retrieval tasks: Cloud Optical Thickness (COT), Cloud Effective Radius (CER), Cloud Water Path (CWP), and Cloud Top Height (CTH). To support this, the researchers also released HyperFM250K, a large-scale hyperspectral dataset from the PACE mission that includes both clear and cloudy scenes, comprising 253,104 tiles totaling over 3 TB.

Key takeaway

For AI Scientists and Machine Learning Engineers developing models for atmospheric science, HyperFM offers a robust, parameter-efficient solution for hyperspectral data analysis. Its superior performance on cloud property retrieval tasks, enabled by specialized spectral grouping and hybrid tensor-train decomposition, suggests that focusing on domain-specific architectural innovations and diverse training data (like HyperFM250K) is critical. Consider adopting HyperFM's architectural principles to improve accuracy and computational efficiency in your hyperspectral applications, especially those involving complex atmospheric conditions.

Key insights

HyperFM is a parameter-efficient hyperspectral foundation model outperforming baselines on cloud property retrieval tasks.

Principles

Spectral grouping improves hyperspectral feature extraction.
Hybrid tensor-train decomposition reduces model parameters.
Pretraining on cloudy data is crucial for atmospheric tasks.

Method

HyperFM uses a "Group Embed" module for group-wise spectral-spatial feature extraction, followed by "Hypoformer" blocks with Hybrid Tensor Train (HTT) Attention and Low Matrix Factorization (LMF) FFN for parameter efficiency. It is pretrained via Masked Autoencoder (MAE).

In practice

Use HyperFM250K for cloud-focused hyperspectral model training.
Implement spectral grouping to handle high-dimensional HSI data.
Apply hybrid tensor-train decomposition for efficient model scaling.

Topics

HyperFM
HyperFM250K Dataset
Hyperspectral Foundation Models
Cloud Property Retrieval
NASA PACE OCI

Code references

umbc-sanjaylab/HyperFM

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.