CVFeb 24, 2025

SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations

arXiv:2502.17056v11 citationsh-index: 26
Originality Incremental advance
AI Analysis

This addresses the problem of data scarcity for researchers in hyperspectral remote sensing, though it is incremental as it applies existing generative methods to a new domain.

The paper tackles the high cost of acquiring annotated hyperspectral images for dense prediction tasks by proposing SpecDM, a generative diffusion model that synthesizes hyperspectral images with pixel-level semantic annotations, and experiments show it improves performance on semantic segmentation and change detection tasks.

In hyperspectral remote sensing field, some downstream dense prediction tasks, such as semantic segmentation (SS) and change detection (CD), rely on supervised learning to improve model performance and require a large amount of manually annotated data for training. However, due to the needs of specific equipment and special application scenarios, the acquisition and annotation of hyperspectral images (HSIs) are often costly and time-consuming. To this end, our work explores the potential of generative diffusion model in synthesizing HSIs with pixel-level annotations. The main idea is to utilize a two-stream VAE to learn the latent representations of images and corresponding masks respectively, learn their joint distribution during the diffusion model training, and finally obtain the image and mask through their respective decoders. To the best of our knowledge, it is the first work to generate high-dimensional HSIs with annotations. Our proposed approach can be applied in various kinds of dataset generation. We select two of the most widely used dense prediction tasks: semantic segmentation and change detection, and generate datasets suitable for these tasks. Experiments demonstrate that our synthetic datasets have a positive impact on the improvement of these downstream tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes