CVLGMar 8, 2024

Improving Diffusion-Based Generative Models via Approximated Optimal Transport

arXiv:2403.05069v16 citationsh-index: 14
Originality Highly original
AI Analysis

This work addresses the challenge of efficient and high-quality image generation for AI and machine learning applications, representing a strong incremental improvement in diffusion model training.

The paper tackles the problem of improving diffusion-based generative models by introducing the Approximated Optimal Transport (AOT) technique, which enhances denoiser output estimation, leading to lower curvature ODE trajectories and reduced sampling steps, achieving FID scores as low as 1.58 with 29 NFEs in conditional generation.

We introduce the Approximated Optimal Transport (AOT) technique, a novel training scheme for diffusion-based generative models. Our approach aims to approximate and integrate optimal transport into the training process, significantly enhancing the ability of diffusion models to estimate the denoiser outputs accurately. This improvement leads to ODE trajectories of diffusion models with lower curvature and reduced truncation errors during sampling. We achieve superior image quality and reduced sampling steps by employing AOT in training. Specifically, we achieve FID scores of 1.88 with just 27 NFEs and 1.73 with 29 NFEs in unconditional and conditional generations, respectively. Furthermore, when applying AOT to train the discriminator for guidance, we establish new state-of-the-art FID scores of 1.68 and 1.58 for unconditional and conditional generations, respectively, each with 29 NFEs. This outcome demonstrates the effectiveness of AOT in enhancing the performance of diffusion models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes