CVIVMar 30, 2023

KD-DLGAN: Data Limited Image Generation via Knowledge Distillation

arXiv:2303.17158v129 citationsh-index: 51
Originality Incremental advance
AI Analysis

This addresses a practical limitation in generative modeling for applications with scarce data, offering incremental improvements over existing methods.

The paper tackles the problem of training GANs with limited data, which causes discriminator overfitting and reduced generation diversity, by proposing KD-DLGAN, a knowledge distillation framework that uses pre-trained vision-language models to improve generation quality and diversity, achieving superior performance on multiple benchmarks.

Generative Adversarial Networks (GANs) rely heavily on large-scale training data for training high-quality image generation models. With limited training data, the GAN discriminator often suffers from severe overfitting which directly leads to degraded generation especially in generation diversity. Inspired by the recent advances in knowledge distillation (KD), we propose KD-DLGAN, a knowledge-distillation based generation framework that introduces pre-trained vision-language models for training effective data-limited generation models. KD-DLGAN consists of two innovative designs. The first is aggregated generative KD that mitigates the discriminator overfitting by challenging the discriminator with harder learning tasks and distilling more generalizable knowledge from the pre-trained models. The second is correlated generative KD that improves the generation diversity by distilling and preserving the diverse image-text correlation within the pre-trained models. Extensive experiments over multiple benchmarks show that KD-DLGAN achieves superior image generation with limited training data. In addition, KD-DLGAN complements the state-of-the-art with consistent and substantial performance gains.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes