CVJan 22

Towards Realistic Remote Sensing Dataset Distillation with Discriminative Prototype-guided Diffusion

arXiv:2601.15829v1h-index: 74Has Code
Originality Incremental advance
AI Analysis

It addresses data efficiency and privacy issues for remote sensing applications, but is incremental as it adapts existing dataset distillation methods to this domain.

This study tackles the challenges of high storage costs and data leakage in remote sensing image interpretation by introducing dataset distillation, using a diffusion model to condense large datasets into compact, representative ones, achieving realistic and diverse samples for downstream training as shown on three benchmarks.

Recent years have witnessed the remarkable success of deep learning in remote sensing image interpretation, driven by the availability of large-scale benchmark datasets. However, this reliance on massive training data also brings two major challenges: (1) high storage and computational costs, and (2) the risk of data leakage, especially when sensitive categories are involved. To address these challenges, this study introduces the concept of dataset distillation into the field of remote sensing image interpretation for the first time. Specifically, we train a text-to-image diffusion model to condense a large-scale remote sensing dataset into a compact and representative distilled dataset. To improve the discriminative quality of the synthesized samples, we propose a classifier-driven guidance by injecting a classification consistency loss from a pre-trained model into the diffusion training process. Besides, considering the rich semantic complexity of remote sensing imagery, we further perform latent space clustering on training samples to select representative and diverse prototypes as visual style guidance, while using a visual language model to provide aggregated text descriptions. Experiments on three high-resolution remote sensing scene classification benchmarks show that the proposed method can distill realistic and diverse samples for downstream model training. Code and pre-trained models are available online (https://github.com/YonghaoXu/DPD).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes