CVAIIVAug 1, 2024

Image Super-Resolution with Taylor Expansion Approximation and Large Field Reception

arXiv:2408.00470v11 citationsh-index: 4
Originality Incremental advance
AI Analysis

This work addresses computational bottlenecks in image super-resolution for applications requiring efficient processing, though it is incremental as it builds on existing self-similarity methods.

The paper tackles the high computational cost of self-similarity techniques in blind super-resolution by proposing a second-order Taylor expansion approximation to reduce complexity from O(N^2) to O(N), and a multi-scale large field reception to maintain performance, with LabNet setting new benchmarks on synthetic datasets and RealNet achieving superior visual quality on real-world data.

Self-similarity techniques are booming in blind super-resolution (SR) due to accurate estimation of the degradation types involved in low-resolution images. However, high-dimensional matrix multiplication within self-similarity computation prohibitively consumes massive computational costs. We find that the high-dimensional attention map is derived from the matrix multiplication between Query and Key, followed by a softmax function. This softmax makes the matrix multiplication between Query and Key inseparable, posing a great challenge in simplifying computational complexity. To address this issue, we first propose a second-order Taylor expansion approximation (STEA) to separate the matrix multiplication of Query and Key, resulting in the complexity reduction from $\mathcal{O}(N^2)$ to $\mathcal{O}(N)$. Then, we design a multi-scale large field reception (MLFR) to compensate for the performance degradation caused by STEA. Finally, we apply these two core designs to laboratory and real-world scenarios by constructing LabNet and RealNet, respectively. Extensive experimental results tested on five synthetic datasets demonstrate that our LabNet sets a new benchmark in qualitative and quantitative evaluations. Tested on the RealWorld38 dataset, our RealNet achieves superior visual quality over existing methods. Ablation studies further verify the contributions of STEA and MLFR towards both LabNet and RealNet frameworks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes