CVJan 17, 2025

DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration

arXiv:2501.10325v11 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses the problem of restoring high-quality stereo images for applications like VR or 3D vision, representing an incremental advance by adapting diffusion models to a new domain.

The paper tackles stereo image restoration by proposing DiffStereo, a high-frequency aware diffusion model that preserves texture details and reduces computational cost, achieving higher reconstruction accuracy and better perceptual quality in tasks like super-resolution, deblurring, and low-light enhancement compared to state-of-the-art methods.

Diffusion models (DMs) have achieved promising performance in image restoration but haven't been explored for stereo images. The application of DM in stereo image restoration is confronted with a series of challenges. The need to reconstruct two images exacerbates DM's computational cost. Additionally, existing latent DMs usually focus on semantic information and remove high-frequency details as redundancy during latent compression, which is precisely what matters for image restoration. To address the above problems, we propose a high-frequency aware diffusion model, DiffStereo for stereo image restoration as the first attempt at DM in this domain. Specifically, DiffStereo first learns latent high-frequency representations (LHFR) of HQ images. DM is then trained in the learned space to estimate LHFR for stereo images, which are fused into a transformer-based stereo image restoration network providing beneficial high-frequency information of corresponding HQ images. The resolution of LHFR is kept the same as input images, which preserves the inherent texture from distortion. And the compression in channels alleviates the computational burden of DM. Furthermore, we devise a position encoding scheme when integrating the LHFR into the restoration network, enabling distinctive guidance in different depths of the restoration network. Comprehensive experiments verify that by combining generative DM and transformer, DiffStereo achieves both higher reconstruction accuracy and better perceptual quality on stereo super-resolution, deblurring, and low-light enhancement compared with state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes