CVLGOct 24, 2022

High-Resolution Image Editing via Multi-Stage Blended Diffusion

arXiv:2210.12965v117 citationsh-index: 9
Originality Incremental advance
AI Analysis

This work addresses a computational bottleneck for image editing applications, offering an incremental improvement over current diffusion-based techniques.

The paper tackles the problem of high-resolution image editing with diffusion models, which are computationally limited to low resolutions, by proposing a multi-stage approach that combines Blended Diffusion with super-resolution upscaling to achieve higher visual fidelity and better global consistency than existing methods.

Diffusion models have shown great results in image generation and in image editing. However, current approaches are limited to low resolutions due to the computational cost of training diffusion models for high-resolution generation. We propose an approach that uses a pre-trained low-resolution diffusion model to edit images in the megapixel range. We first use Blended Diffusion to edit the image at a low resolution, and then upscale it in multiple stages, using a super-resolution model and Blended Diffusion. Using our approach, we achieve higher visual fidelity than by only applying off the shelf super-resolution methods to the output of the diffusion model. We also obtain better global consistency than directly using the diffusion model at a higher resolution.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes