CVAIMar 26

FD$^2$: A Dedicated Framework for Fine-Grained Dataset Distillation

arXiv:2603.2514475.71 citationsh-index: 26
Predicted impact top 35% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses a domain-specific bottleneck in dataset distillation for fine-grained recognition tasks, offering incremental improvements over prior decoupled methods.

The paper tackles the problem of dataset distillation for fine-grained datasets, where existing methods produce overly similar intra-class samples and fail to capture subtle inter-class differences, and proposes FD^2, which improves performance on multiple fine-grained and general datasets.

Dataset distillation (DD) compresses a large training set into a small synthetic set, reducing storage and training cost, and has shown strong results on general benchmarks. Decoupled DD further improves efficiency by splitting the pipeline into pretraining, sample distillation, and soft-label generation. However, existing decoupled methods largely rely on coarse class-label supervision and optimize samples within each class in a nearly identical manner. On fine-grained datasets, this often yields distilled samples that (i) retain large intra-class variation with subtle inter-class differences and (ii) become overly similar within the same class, limiting localized discriminative cues and hurting recognition. To solve the above-mentioned problems, we propose FD$^{2}$, a dedicated framework for Fine-grained Dataset Distillation. FD$^{2}$ localizes discriminative regions and constructs fine-grained representations for distillation. During pretraining, counterfactual attention learning aggregates discriminative representations to update class prototypes. During distillation, a fine-grained characteristic constraint aligns each sample with its class prototype while repelling others, and a similarity constraint diversifies attention across same-class samples. Experiments on multiple fine-grained and general datasets show that FD$^{2}$ integrates seamlessly with decoupled DD and improves performance in most settings, indicating strong transferability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes