CVAIMay 24, 2025

Restoring Real-World Images with an Internal Detail Enhancement Diffusion Model

arXiv:2505.18674v2h-index: 7
Originality Incremental advance
AI Analysis

This addresses the challenge of high-fidelity restoration with object-level control for applications like old photo restoration, but it is incremental as it builds on existing diffusion models.

The paper tackles the problem of restoring real-world degraded images with complex mixed degradations by proposing an internal detail-preserving diffusion model, achieving significant outperformance over state-of-the-art models in qualitative and perceptual quantitative evaluations.

Restoring real-world degraded images, such as old photographs or low-resolution images, presents a significant challenge due to the complex, mixed degradations they exhibit, such as scratches, color fading, and noise. Recent data-driven approaches have struggled with two main challenges: achieving high-fidelity restoration and providing object-level control over colorization. While diffusion models have shown promise in generating high-quality images with specific controls, they often fail to fully preserve image details during restoration. In this work, we propose an internal detail-preserving diffusion model for high-fidelity restoration of real-world degraded images. Our method utilizes a pre-trained Stable Diffusion model as a generative prior, eliminating the need to train a model from scratch. Central to our approach is the Internal Image Detail Enhancement (IIDE) technique, which directs the diffusion model to preserve essential structural and textural information while mitigating degradation effects. The process starts by mapping the input image into a latent space, where we inject the diffusion denoising process with degradation operations that simulate the effects of various degradation factors. Extensive experiments demonstrate that our method significantly outperforms state-of-the-art models in both qualitative assessments and perceptual quantitative evaluations. Additionally, our approach supports text-guided restoration, enabling object-level colorization control that mimics the expertise of professional photo editing.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes