CVIVJul 3, 2023

ACDMSR: Accelerated Conditional Diffusion Models for Single Image Super-Resolution

arXiv:2307.00781v167 citationsh-index: 69
Originality Incremental advance
AI Analysis

This work addresses a practical bottleneck in applying diffusion models to image super-resolution, offering faster inference and improved performance for tasks requiring high-quality image upscaling.

The paper tackles the slow inference speed of diffusion models for single image super-resolution by proposing ACDMSR, which uses a deterministic iterative denoising process and a pre-trained SR model for conditioning, achieving superior qualitative and quantitative results on benchmark datasets like Set5 and Urban100.

Diffusion models have gained significant popularity in the field of image-to-image translation. Previous efforts applying diffusion models to image super-resolution (SR) have demonstrated that iteratively refining pure Gaussian noise using a U-Net architecture trained on denoising at various noise levels can yield satisfactory high-resolution images from low-resolution inputs. However, this iterative refinement process comes with the drawback of low inference speed, which strongly limits its applications. To speed up inference and further enhance the performance, our research revisits diffusion models in image super-resolution and proposes a straightforward yet significant diffusion model-based super-resolution method called ACDMSR (accelerated conditional diffusion model for image super-resolution). Specifically, our method adapts the standard diffusion model to perform super-resolution through a deterministic iterative denoising process. Our study also highlights the effectiveness of using a pre-trained SR model to provide the conditional image of the given low-resolution (LR) image to achieve superior high-resolution results. We demonstrate that our method surpasses previous attempts in qualitative and quantitative results through extensive experiments conducted on benchmark datasets such as Set5, Set14, Urban100, BSD100, and Manga109. Moreover, our approach generates more visually realistic counterparts for low-resolution images, emphasizing its effectiveness in practical scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes