IV CVOct 23, 2024

A Wavelet Diffusion GAN for Image Super-Resolution

Lorenzo Aloisi, Luigi Sigillo, Aurelio Uncini, Danilo Comminiello

arXiv:2410.17966v213.36 citationsh-index: 38

Originality Incremental advance

AI Analysis

This work addresses the real-time feasibility problem for time-sensitive applications like image super-resolution, representing an incremental improvement by combining existing methods.

The paper tackles the slow training and inference speeds of diffusion models for image super-resolution by proposing a wavelet-based conditional Diffusion GAN scheme, which reduces timesteps and dimensionality to achieve faster performance while maintaining high-fidelity output, as validated on the CelebA-HQ dataset.

In recent years, diffusion models have emerged as a superior alternative to generative adversarial networks (GANs) for high-fidelity image generation, with wide applications in text-to-image generation, image-to-image translation, and super-resolution. However, their real-time feasibility is hindered by slow training and inference speeds. This study addresses this challenge by proposing a wavelet-based conditional Diffusion GAN scheme for Single-Image Super-Resolution (SISR). Our approach utilizes the diffusion GAN paradigm to reduce the timesteps required by the reverse diffusion process and the Discrete Wavelet Transform (DWT) to achieve dimensionality reduction, decreasing training and inference times significantly. The results of an experimental validation on the CelebA-HQ dataset confirm the effectiveness of our proposed scheme. Our approach outperforms other state-of-the-art methodologies successfully ensuring high-fidelity output while overcoming inherent drawbacks associated with diffusion models in time-sensitive applications.

View on arXiv PDF

Similar