CVApr 2

Cosine-Normalized Attention for Hyperspectral Image Classification

arXiv:2604.0176318.4h-index: 1
AI Analysis

This work addresses a specific bottleneck in hyperspectral image classification for remote sensing applications, offering an incremental improvement in attention mechanisms.

The paper tackles the suboptimality of dot-product attention in hyperspectral image classification by introducing cosine-normalized attention, which emphasizes angular relationships and reduces sensitivity to magnitude variations, resulting in consistent performance gains on three benchmark datasets, outperforming recent Transformer- and Mamba-based models with a lightweight backbone.

Transformer-based methods have improved hyperspectral image classification (HSIC) by modeling long-range spatial-spectral dependencies; however, their attention mechanisms typically rely on dot-product similarity, which mixes feature magnitude and orientation and may be suboptimal for hyperspectral data. This work revisits attention scoring from a geometric perspective and introduces a cosine-normalized attention formulation that aligns similarity computation with the angular structure of hyperspectral signatures. By projecting query and key embeddings onto a unit hypersphere and applying a squared cosine similarity, the proposed method emphasizes angular relationships while reducing sensitivity to magnitude variations. The formulation is integrated into a spatial-spectral Transformer and evaluated under extremely limited supervision. Experiments on three benchmark datasets demonstrate that the proposed approach consistently achieves higher performance, outperforming several recent Transformer- and Mamba-based models despite using a lightweight backbone. In addition, a controlled analysis of multiple attention score functions shows that cosine-based scoring provides a reliable inductive bias for hyperspectral representation learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes