CV AI LGDec 30, 2022

Image Embedding for Denoising Generative Models

Andrea Asperti, Davide Evangelista, Samuele Marro, Fabio Merizzi

arXiv:2301.07485v18.414 citationsh-index: 29Has Code

Originality Incremental advance

AI Analysis

This addresses image manipulation and editing for generative modeling, but it is incremental as it builds on existing diffusion models.

The paper tackles the problem of embedding an image into the latent space of Denoising Diffusion Implicit Models to find a noisy image that denoises to the original, revealing insights into the latent structure and independence from network implementations.

Denoising Diffusion models are gaining increasing popularity in the field of generative modeling for several reasons, including the simple and stable training, the excellent generative quality, and the solid probabilistic foundation. In this article, we address the problem of {\em embedding} an image into the latent space of Denoising Diffusion Models, that is finding a suitable ``noisy'' image whose denoising results in the original image. We particularly focus on Denoising Diffusion Implicit Models due to the deterministic nature of their reverse diffusion process. As a side result of our investigation, we gain a deeper insight into the structure of the latent space of diffusion models, opening interesting perspectives on its exploration, the definition of semantic trajectories, and the manipulation/conditioning of encodings for editing purposes. A particularly interesting property highlighted by our research, which is also characteristic of this class of generative models, is the independence of the latent representation from the networks implementing the reverse diffusion process. In other words, a common seed passed to different networks (each trained on the same dataset), eventually results in identical images.

View on arXiv PDF Code

Similar