CVMar 31, 2024

DMSSN: Distilled Mixed Spectral-Spatial Network for Hyperspectral Salient Object Detection

arXiv:2404.00694v123 citationsh-index: 21Has CodeIEEE Trans Geosci Remote Sens
Originality Incremental advance
AI Analysis

This work addresses hyperspectral salient object detection for applications in complex scenarios where RGB methods fail, representing an incremental improvement with a new dataset.

The paper tackles the loss of spectral information and insufficient feature extraction in hyperspectral salient object detection by proposing DMSSN, which uses knowledge distillation for dimension reduction and a transformer for feature extraction, achieving state-of-the-art performance on multiple datasets.

Hyperspectral salient object detection (HSOD) has exhibited remarkable promise across various applications, particularly in intricate scenarios where conventional RGB-based approaches fall short. Despite the considerable progress in HSOD method advancements, two critical challenges require immediate attention. Firstly, existing hyperspectral data dimension reduction techniques incur a loss of spectral information, which adversely affects detection accuracy. Secondly, previous methods insufficiently harness the inherent distinctive attributes of hyperspectral images (HSIs) during the feature extraction process. To address these challenges, we propose a novel approach termed the Distilled Mixed Spectral-Spatial Network (DMSSN), comprising a Distilled Spectral Encoding process and a Mixed Spectral-Spatial Transformer (MSST) feature extraction network. The encoding process utilizes knowledge distillation to construct a lightweight autoencoder for dimension reduction, striking a balance between robust encoding capabilities and low computational costs. The MSST extracts spectral-spatial features through multiple attention head groups, collaboratively enhancing its resistance to intricate scenarios. Moreover, we have created a large-scale HSOD dataset, HSOD-BIT, to tackle the issue of data scarcity in this field and meet the fundamental data requirements of deep network training. Extensive experiments demonstrate that our proposed DMSSN achieves state-of-the-art performance on multiple datasets. We will soon make the code and dataset publicly available on https://github.com/anonymous0519/HSOD-BIT.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes