CVApr 21, 2025

Distribution-aware Dataset Distillation for Efficient Image Restoration

arXiv:2504.14826v11 citationsh-index: 7
Originality Incremental advance
AI Analysis

This addresses the computational burden for researchers and practitioners in image restoration, though it is incremental as it extends dataset distillation to a new domain.

The paper tackles the problem of laborious training for image restoration models by proposing a distribution-aware dataset distillation method that selects a small subset of images based on complexity and fine-tunes their distribution, achieving state-of-the-art performance with 500x savings in computing resources and training in under 8 hours on a consumer GPU.

With the exponential increase in image data, training an image restoration model is laborious. Dataset distillation is a potential solution to this problem, yet current distillation techniques are a blank canvas in the field of image restoration. To fill this gap, we propose the Distribution-aware Dataset Distillation method (TripleD), a new framework that extends the principles of dataset distillation to image restoration. Specifically, TripleD uses a pre-trained vision Transformer to extract features from images for complexity evaluation, and the subset (the number of samples is much smaller than the original training set) is selected based on complexity. The selected subset is then fed through a lightweight CNN that fine-tunes the image distribution to align with the distribution of the original dataset at the feature level. To efficiently condense knowledge, the training is divided into two stages. Early stages focus on simpler, low-complexity samples to build foundational knowledge, while later stages select more complex and uncertain samples as the model matures. Our method achieves promising performance on multiple image restoration tasks, including multi-task image restoration, all-in-one image restoration, and ultra-high-definition image restoration tasks. Note that we can train a state-of-the-art image restoration model on an ultra-high-definition (4K resolution) dataset using only one consumer-grade GPU in less than 8 hours (500 savings in computing resources and immeasurable training time).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes