CVIVApr 14, 2022

Residual Swin Transformer Channel Attention Network for Image Demosaicing

arXiv:2204.07098v120 citationsh-index: 54
Originality Incremental advance
AI Analysis

This is an incremental improvement for image processing applications, enhancing demosaicing performance with a more efficient model.

The paper tackles image demosaicing by proposing RSTCANet, a Swin Transformer-based network with residual channel attention blocks, which outperforms state-of-the-art methods and has fewer parameters.

Image demosaicing is problem of interpolating full- resolution color images from raw sensor (color filter array) data. During last decade, deep neural networks have been widely used in image restoration, and in particular, in demosaicing, attaining significant performance improvement. In recent years, vision transformers have been designed and successfully used in various computer vision applications. One of the recent methods of image restoration based on a Swin Transformer (ST), SwinIR, demonstrates state-of-the-art performance with a smaller number of parameters than neural network-based methods. Inspired by the success of SwinIR, we propose in this paper a novel Swin Transformer-based network for image demosaicing, called RSTCANet. To extract image features, RSTCANet stacks several residual Swin Transformer Channel Attention blocks (RSTCAB), introducing the channel attention for each two successive ST blocks. Extensive experiments demonstrate that RSTCANet out- performs state-of-the-art image demosaicing methods, and has a smaller number of parameters.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes