CVLGIVDec 7, 2024

Enhancing Sample Generation of Diffusion Models using Noise Level Correction

CMU
arXiv:2412.05488v34 citationsh-index: 14Trans. Mach. Learn. Res.
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in diffusion models for image generation and restoration, offering incremental improvements in sample quality.

The paper tackled the problem of enhancing sample generation in diffusion models by aligning estimated noise levels with true distances to the data manifold, resulting in significant improvements in sample quality for both unconstrained and constrained generation scenarios.

The denoising process of diffusion models can be interpreted as an approximate projection of noisy samples onto the data manifold. Moreover, the noise level in these samples approximates their distance to the underlying manifold. Building on this insight, we propose a novel method to enhance sample generation by aligning the estimated noise level with the true distance of noisy samples to the manifold. Specifically, we introduce a noise level correction network, leveraging a pre-trained denoising network, to refine noise level estimates during the denoising process. Additionally, we extend this approach to various image restoration tasks by integrating task-specific constraints, including inpainting, deblurring, super-resolution, colorization, and compressed sensing. Experimental results demonstrate that our method significantly improves sample quality in both unconstrained and constrained generation scenarios. Notably, the proposed noise level correction framework is compatible with existing denoising schedulers (e.g., DDIM), offering additional performance improvements.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes