CVJan 24, 2023

Image Super-Resolution using Efficient Striped Window Transformer

arXiv:2301.09869v216 citationsh-index: 131
AI Analysis

This work addresses the problem of efficient super-resolution for applications requiring lightweight models, though it is incremental in improving transformer-based methods.

The paper tackles the challenge of balancing performance and complexity in lightweight image super-resolution by proposing an Efficient Striped Window Transformer, which outperforms state-of-the-art methods with fewer parameters, faster inference, smaller FLOPs, and less memory consumption.

Transformers have achieved remarkable results in single-image super-resolution (SR). However, the challenge of balancing model performance and complexity has hindered their application in lightweight SR (LSR). To tackle this challenge, we propose an efficient striped window transformer (ESWT). We revisit the normalization layer in the transformer and design a concise and efficient transformer structure to build the ESWT. Furthermore, we introduce a striped window mechanism to model long-term dependencies more efficiently. To fully exploit the potential of the ESWT, we propose a novel flexible window training strategy that can improve the performance of the ESWT without additional cost. Extensive experiments show that ESWT outperforms state-of-the-art LSR transformers, and achieves a better trade-off between model performance and complexity. The ESWT requires fewer parameters, incurs faster inference, smaller FLOPs, and less memory consumption, making it a promising solution for LSR.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes