CVLGJun 13, 2021

DMSANet: Dual Multi Scale Attention Network

arXiv:2106.08382v228 citations
Originality Incremental advance
AI Analysis

This addresses efficiency issues in attention-based vision models for researchers and practitioners, though it appears incremental as it builds on existing attention paradigms.

The paper tackles the problem of attention mechanisms in computer vision increasing computational complexity by proposing DMSANet, a dual multi-scale attention network that achieves state-of-the-art performance with fewer parameters on ImageNet classification and MS COCO object detection/segmentation.

Attention mechanism of late has been quite popular in the computer vision community. A lot of work has been done to improve the performance of the network, although almost always it results in increased computational complexity. In this paper, we propose a new attention module that not only achieves the best performance but also has lesser parameters compared to most existing models. Our attention module can easily be integrated with other convolutional neural networks because of its lightweight nature. The proposed network named Dual Multi Scale Attention Network (DMSANet) is comprised of two parts: the first part is used to extract features at various scales and aggregate them, the second part uses spatial and channel attention modules in parallel to adaptively integrate local features with their global dependencies. We benchmark our network performance for Image Classification on ImageNet dataset, Object Detection and Instance Segmentation both on MS COCO dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes