CVAISep 12, 2025

Realism Control One-step Diffusion for Real-World Image Super-Resolution

arXiv:2509.10122v25 citationsh-index: 8
Originality Incremental advance
AI Analysis

This work addresses a key limitation in efficient image restoration for applications requiring high-quality reconstructions, though it is incremental in improving control mechanisms within existing diffusion-based approaches.

The paper tackles the problem of balancing fidelity and realism in one-step diffusion models for real-world image super-resolution by proposing a Realism Controlled One-step Diffusion (RCOD) framework, which achieves superior performance in quantitative metrics and visual quality compared to state-of-the-art methods.

Pre-trained diffusion models have shown great potential in real-world image super-resolution (Real-ISR) tasks by enabling high-resolution reconstructions. While one-step diffusion (OSD) methods significantly improve efficiency compared to traditional multi-step approaches, they still have limitations in balancing fidelity and realism across diverse scenarios. Since the OSDs for SR are usually trained or distilled by a single timestep, they lack flexible control mechanisms to adaptively prioritize these competing objectives, which are inherently manageable in multi-step methods through adjusting sampling steps. To address this challenge, we propose a Realism Controlled One-step Diffusion (RCOD) framework for Real-ISR. RCOD provides a latent domain grouping strategy that enables explicit control over fidelity-realism trade-offs during the noise prediction phase with minimal training paradigm modifications and original training data. A degradation-aware sampling strategy is also introduced to align distillation regularization with the grouping strategy and enhance the controlling of trade-offs. Moreover, a visual prompt injection module is used to replace conventional text prompts with degradation-aware visual tokens, enhancing both restoration accuracy and semantic consistency. Our method achieves superior fidelity and perceptual quality while maintaining computational efficiency. Extensive experiments demonstrate that RCOD outperforms state-of-the-art OSD methods in both quantitative metrics and visual qualities, with flexible realism control capabilities in the inference stage.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes