LGAICVDec 20, 2023

Adaptive Training Meets Progressive Scaling: Elevating Efficiency in Diffusion Models

arXiv:2312.13307v33 citationsh-index: 19ICME
Originality Incremental advance
AI Analysis

This work addresses training conflicts in diffusion models for generative tasks, offering an incremental efficiency gain.

The paper tackles the inefficiency of uniform denoising models in diffusion models by proposing TDC Training, a two-stage strategy that groups timesteps and customizes models, resulting in a 1.5 FID improvement on ImageNet64 and 20% computational savings.

Diffusion models have demonstrated remarkable efficacy in various generative tasks with the predictive prowess of denoising model. Currently, diffusion models employ a uniform denoising model across all timesteps. However, the inherent variations in data distributions at different timesteps lead to conflicts during training, constraining the potential of diffusion models. To address this challenge, we propose a novel two-stage divide-and-conquer training strategy termed TDC Training. It groups timesteps based on task similarity and difficulty, assigning highly customized denoising models to each group, thereby enhancing the performance of diffusion models. While two-stage training avoids the need to train each model separately, the total training cost is even lower than training a single unified denoising model. Additionally, we introduce Proxy-based Pruning to further customize the denoising models. This method transforms the pruning problem of diffusion models into a multi-round decision-making problem, enabling precise pruning of diffusion models. Our experiments validate the effectiveness of TDC Training, demonstrating improvements in FID of 1.5 on ImageNet64 compared to original IDDPM, while saving about 20\% of computational resources.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes