CVOct 18, 2024

ControlSR: Taming Diffusion Models for Consistent Real-World Image Super Resolution

arXiv:2410.14279v23 citationsh-index: 16Has Code
Originality Incremental advance
AI Analysis

This work addresses inconsistency issues in image super-resolution for applications like photography and computer vision, representing an incremental improvement over prior methods.

The paper tackles the problem of inconsistent content in real-world image super-resolution by taming diffusion models to better utilize low-resolution information, resulting in more consistent and clearer outputs with improved performance across multiple metrics.

We present ControlSR, a new method that can tame Diffusion Models for consistent real-world image super-resolution (Real-ISR). Previous Real-ISR models mostly focus on how to activate more generative priors of text-to-image diffusion models to make the output high-resolution (HR) images look better. However, since these methods rely too much on the generative priors, the content of the output images is often inconsistent with the input LR ones. To mitigate the above issue, in this work, we tame Diffusion Models by effectively utilizing LR information to impose stronger constraints on the control signals from ControlNet in the latent space. We show that our method can produce higher-quality control signals, which enables the super-resolution results to be more consistent with the LR image and leads to clearer visual results. In addition, we also propose an inference strategy that imposes constraints in the latent space using LR information, allowing for the simultaneous improvement of fidelity and generative ability. Experiments demonstrate that our model can achieve better performance across multiple metrics on several test sets and generate more consistent SR results with LR images than existing methods. Our code is available at https://github.com/HVision-NKU/ControlSR.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes