CVJul 17, 2024

GeoGuide: Geometric guidance of diffusion models

arXiv:2407.12889v16 citationsh-index: 16
Originality Incremental advance
AI Analysis

This addresses a problem for researchers and practitioners in generative AI by offering a more effective guidance method for diffusion models, though it is incremental as it builds on existing guidance approaches.

The paper tackles the challenge of guiding pre-trained diffusion models to generate elements from previously unlabeled data, which suffers from a quality gap compared to class-conditioned models, with ADM-G showing nearly three times lower FID scores. The proposed GeoGuide method, based on geometric distance tracing, improves FID scores and image quality over ADM-G.

Diffusion models are among the most effective methods for image generation. This is in particular because, unlike GANs, they can be easily conditioned during training to produce elements with desired class or properties. However, guiding a pre-trained diffusion model to generate elements from previously unlabeled data is significantly more challenging. One of the possible solutions was given by the ADM-G guiding approach. Although ADM-G successfully generates elements from the given class, there is a significant quality gap compared to a model originally conditioned on this class. In particular, the FID score obtained by the ADM-G-guided diffusion model is nearly three times lower than the class-conditioned guidance. We demonstrate that this issue is partly due to ADM-G providing minimal guidance during the final stage of the denoising process. To address this problem, we propose GeoGuide, a guidance model based on tracing the distance of the diffusion model's trajectory from the data manifold. The main idea of GeoGuide is to produce normalized adjustments during the backward denoising process. As shown in the experiments, GeoGuide surpasses the probabilistic approach ADM-G with respect to both the FID scores and the quality of the generated images.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes