CVNov 15, 2024

DiffFNO: Diffusion Fourier Neural Operator

arXiv:2411.09911v220 citationsh-index: 1CVPR
Originality Highly original
AI Analysis

This addresses the problem of high-quality image super-resolution for computer vision applications, representing a significant but incremental advance over existing methods.

The paper tackles arbitrary-scale super-resolution by introducing DiffFNO, a diffusion framework that combines a Weighted Fourier Neural Operator with spatial features, achieving state-of-the-art results with 2-4 dB PSNR improvements across various scaling factors and lower inference time.

We introduce DiffFNO, a novel diffusion framework for arbitrary-scale super-resolution strengthened by a Weighted Fourier Neural Operator (WFNO). Mode Rebalancing in WFNO effectively captures critical frequency components, significantly improving the reconstruction of high-frequency image details that are crucial for super-resolution tasks. Gated Fusion Mechanism (GFM) adaptively complements WFNO's spectral features with spatial features from an Attention-based Neural Operator (AttnNO). This enhances the network's capability to capture both global structures and local details. Adaptive Time-Step (ATS) ODE solver, a deterministic sampling strategy, accelerates inference without sacrificing output quality by dynamically adjusting integration step sizes ATS. Extensive experiments demonstrate that DiffFNO achieves state-of-the-art (SOTA) results, outperforming existing methods across various scaling factors by a margin of 2-4 dB in PSNR, including those beyond the training distribution. It also achieves this at lower inference time. Our approach sets a new standard in super-resolution, delivering both superior accuracy and computational efficiency.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes