CVNov 26, 2024

Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation

Minh-Tuan Tran, Trung Le, Xuan-May Le, Jianfei Cai, Mehrtash Harandi, Dinh Phung

arXiv:2411.17046v16.54 citationsh-index: 29Has Code

Originality Incremental advance

AI Analysis

This addresses the problem of computational cost and noisy image generation in data-free knowledge distillation for large datasets, though it appears incremental as it builds on existing DFKD methods with novel optimizations.

The paper tackles the challenge of data-free knowledge distillation on large-scale datasets like ImageNet by proposing MUSE, which generates synthetic images at lower resolutions using Class Activation Maps to preserve class-specific features, achieving state-of-the-art performance with gains of up to two digits in ImageNet experiments.

Data-Free Knowledge Distillation (DFKD) is an advanced technique that enables knowledge transfer from a teacher model to a student model without relying on original training data. While DFKD methods have achieved success on smaller datasets like CIFAR10 and CIFAR100, they encounter challenges on larger, high-resolution datasets such as ImageNet. A primary issue with previous approaches is their generation of synthetic images at high resolutions (e.g., $224 \times 224$) without leveraging information from real images, often resulting in noisy images that lack essential class-specific features in large datasets. Additionally, the computational cost of generating the extensive data needed for effective knowledge transfer can be prohibitive. In this paper, we introduce MUlti-reSolution data-freE (MUSE) to address these limitations. MUSE generates images at lower resolutions while using Class Activation Maps (CAMs) to ensure that the generated images retain critical, class-specific features. To further enhance model diversity, we propose multi-resolution generation and embedding diversity techniques that strengthen latent space representations, leading to significant performance improvements. Experimental results demonstrate that MUSE achieves state-of-the-art performance across both small- and large-scale datasets, with notable performance gains of up to two digits in nearly all ImageNet and subset experiments. Code is available at https://github.com/tmtuan1307/muse.

View on arXiv PDF Code

Similar