IVCVAug 8, 2024

Efficient Single Image Super-Resolution with Entropy Attention and Receptive Field Augmentation

arXiv:2408.04158v16 citationsh-index: 4
Originality Incremental advance
AI Analysis

This work addresses the practical deployment challenge of lightweight super-resolution models for applications requiring fast processing, though it is incremental in improving efficiency over existing methods.

The paper tackles the computational inefficiency of Transformer-based models for single image super-resolution by introducing an efficient network (EARFA) that uses entropy attention and shifting large kernel attention, achieving comparable SR performance with significantly reduced inference delay.

Transformer-based deep models for single image super-resolution (SISR) have greatly improved the performance of lightweight SISR tasks in recent years. However, they often suffer from heavy computational burden and slow inference due to the complex calculation of multi-head self-attention (MSA), seriously hindering their practical application and deployment. In this work, we present an efficient SR model to mitigate the dilemma between model efficiency and SR performance, which is dubbed Entropy Attention and Receptive Field Augmentation network (EARFA), and composed of a novel entropy attention (EA) and a shifting large kernel attention (SLKA). From the perspective of information theory, EA increases the entropy of intermediate features conditioned on a Gaussian distribution, providing more informative input for subsequent reasoning. On the other hand, SLKA extends the receptive field of SR models with the assistance of channel shifting, which also favors to boost the diversity of hierarchical features. Since the implementation of EA and SLKA does not involve complex computations (such as extensive matrix multiplications), the proposed method can achieve faster nonlinear inference than Transformer-based SR models while maintaining better SR performance. Extensive experiments show that the proposed model can significantly reduce the delay of model inference while achieving the SR performance comparable with other advanced models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes