CVAILGROSep 24, 2025

Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models

arXiv:2509.20107v25 citationsh-index: 34Has CodeIEEE Robot Autom Lett
Originality Incremental advance
AI Analysis

This work addresses robust robotic perception in complex environments by improving semantic segmentation for hyperspectral imaging, representing an incremental advancement over existing methods.

The authors tackled the problem of underperforming hyperspectral semantic segmentation by proposing a hyperspectral adapter that leverages pretrained vision foundation models, achieving state-of-the-art performance on three autonomous driving datasets.

Hyperspectral imaging (HSI) captures spatial information along with dense spectral measurements across numerous narrow wavelength bands. This rich spectral content has the potential to facilitate robust robotic perception, particularly in environments with complex material compositions, varying illumination, or other visually challenging conditions. However, current HSI semantic segmentation methods underperform due to their reliance on architectures and learning frameworks optimized for RGB inputs. In this work, we propose a novel hyperspectral adapter that leverages pretrained vision foundation models to effectively learn from hyperspectral data. Our architecture incorporates a spectral transformer and a spectrum-aware spatial prior module to extract rich spatial-spectral features. Additionally, we introduce a modality-aware interaction block that facilitates effective integration of hyperspectral representations and frozen vision Transformer features through dedicated extraction and injection mechanisms. Extensive evaluations on three benchmark autonomous driving datasets demonstrate that our architecture achieves state-of-the-art semantic segmentation performance while directly using HSI inputs, outperforming both vision-based and hyperspectral segmentation methods. We make the code available at https://hsi-adapter.cs.uni-freiburg.de.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes