CVNov 22, 2022

SinDiffusion: Learning a Diffusion Model from a Single Natural Image

arXiv:2211.12445v166 citationsh-index: 68
Originality Incremental advance
AI Analysis

This work addresses the challenge of single-image generation for applications like text-guided generation and outpainting, representing an incremental advance in diffusion model design for this specific task.

The authors tackled the problem of generating diverse and high-quality images from a single natural image by developing SinDiffusion, a diffusion model that captures internal patch distributions, resulting in significant improvements over GAN-based methods in sample quality and diversity.

We present SinDiffusion, leveraging denoising diffusion models to capture internal distribution of patches from a single natural image. SinDiffusion significantly improves the quality and diversity of generated samples compared with existing GAN-based approaches. It is based on two core designs. First, SinDiffusion is trained with a single model at a single scale instead of multiple models with progressive growing of scales which serves as the default setting in prior work. This avoids the accumulation of errors, which cause characteristic artifacts in generated results. Second, we identify that a patch-level receptive field of the diffusion network is crucial and effective for capturing the image's patch statistics, therefore we redesign the network structure of the diffusion model. Coupling these two designs enables us to generate photorealistic and diverse images from a single image. Furthermore, SinDiffusion can be applied to various applications, i.e., text-guided image generation, and image outpainting, due to the inherent capability of diffusion models. Extensive experiments on a wide range of images demonstrate the superiority of our proposed method for modeling the patch distribution.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes