CVIVJul 26, 2024

Dilated Strip Attention Network for Image Restoration

arXiv:2407.18613v1h-index: 8
Originality Incremental advance
AI Analysis

This work addresses efficiency and effectiveness challenges in image restoration for computer vision applications, representing an incremental improvement over existing attention-based methods.

The paper tackles the problem of limited receptive fields and high parameter counts in attention-based image restoration methods by proposing a Dilated Strip Attention Network (DSAN), which outperforms state-of-the-art algorithms on multiple tasks.

Image restoration is a long-standing task that seeks to recover the latent sharp image from its deteriorated counterpart. Due to the robust capacity of self-attention to capture long-range dependencies, transformer-based methods or some attention-based convolutional neural networks have demonstrated promising results on many image restoration tasks in recent years. However, existing attention modules encounters limited receptive fields or abundant parameters. In order to integrate contextual information more effectively and efficiently, in this paper, we propose a dilated strip attention network (DSAN) for image restoration. Specifically, to gather more contextual information for each pixel from its neighboring pixels in the same row or column, a dilated strip attention (DSA) mechanism is elaborately proposed. By employing the DSA operation horizontally and vertically, each location can harvest the contextual information from a much wider region. In addition, we utilize multi-scale receptive fields across different feature groups in DSA to improve representation learning. Extensive experiments show that our DSAN outperforms state-of-the-art algorithms on several image restoration tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes