IVCVFeb 29, 2024

CAMixerSR: Only Details Need More "Attention"

arXiv:2402.19289v279 citationsh-index: 5CVPR
AI Analysis

This work addresses the problem of efficient high-quality image super-resolution for applications requiring large or complex images, representing an incremental improvement by combining existing schemes.

The paper tackles the challenge of improving quality-complexity trade-offs in large-image super-resolution by integrating content-aware routing and token mixer refining into a content-aware mixer (CAMixer), which assigns convolution for simple contexts and deformable window-attention for sparse textures, achieving superior performance across tasks like large-image, lightweight, and omnidirectional-image SR.

To satisfy the rapidly increasing demands on the large image (2K-8K) super-resolution (SR), prevailing methods follow two independent tracks: 1) accelerate existing networks by content-aware routing, and 2) design better super-resolution networks via token mixer refining. Despite directness, they encounter unavoidable defects (e.g., inflexible route or non-discriminative processing) limiting further improvements of quality-complexity trade-off. To erase the drawbacks, we integrate these schemes by proposing a content-aware mixer (CAMixer), which assigns convolution for simple contexts and additional deformable window-attention for sparse textures. Specifically, the CAMixer uses a learnable predictor to generate multiple bootstraps, including offsets for windows warping, a mask for classifying windows, and convolutional attentions for endowing convolution with the dynamic property, which modulates attention to include more useful textures self-adaptively and improves the representation capability of convolution. We further introduce a global classification loss to improve the accuracy of predictors. By simply stacking CAMixers, we obtain CAMixerSR which achieves superior performance on large-image SR, lightweight SR, and omnidirectional-image SR.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes