LGFeb 9, 2025

Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models

arXiv:2502.05807v27 citationsh-index: 5Has CodeICML
Originality Highly original
AI Analysis

This work addresses the problem of generating high-quality and detailed images for computer vision and graphics applications, providing an incremental improvement over existing techniques.

The authors tackled the problem of controlling sample density in diffusion models to balance realism and detail, and achieved fine-grained control over image detail without compromising sample quality. Their techniques enabled exact log-density control during sampling.

Diffusion models have emerged as a powerful class of generative models, capable of producing high-quality images by mapping noise to a data distribution. However, recent findings suggest that image likelihood does not align with perceptual quality: high-likelihood samples tend to be smooth, while lower-likelihood ones are more detailed. Controlling sample density is thus crucial for balancing realism and detail. In this paper, we analyze an existing technique, Prior Guidance, which scales the latent code to influence image detail. We introduce score alignment, a condition that explains why this method works and show that it can be tractably checked for any continuous normalizing flow model. We then propose Density Guidance, a principled modification of the generative ODE that enables exact log-density control during sampling. Finally, we extend Density Guidance to stochastic sampling, ensuring precise log-density control while allowing controlled variation in structure or fine details. Our experiments demonstrate that these techniques provide fine-grained control over image detail without compromising sample quality. Code is available at https://github.com/Aalto-QuML/density-guidance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes