CVJul 9, 2020

DCANet: Learning Connected Attentions for Convolutional Neural Networks

arXiv:2007.05099v119 citations
Originality Incremental advance
AI Analysis

This work addresses a problem for computer vision researchers and practitioners by enhancing attention mechanisms in CNNs, though it is incremental as it builds on existing attention modules without modifying their internal structure.

The paper tackles the limitation of self-attention mechanisms in vision tasks by proposing DCANet, which interconnects adjacent attention blocks in CNNs to enable joint training and improve attention learning, resulting in consistent outperformance of state-of-the-art attention modules on ImageNet and MS COCO benchmarks with minimal computational overhead.

While self-attention mechanism has shown promising results for many vision tasks, it only considers the current features at a time. We show that such a manner cannot take full advantage of the attention mechanism. In this paper, we present Deep Connected Attention Network (DCANet), a novel design that boosts attention modules in a CNN model without any modification of the internal structure. To achieve this, we interconnect adjacent attention blocks, making information flow among attention blocks possible. With DCANet, all attention blocks in a CNN model are trained jointly, which improves the ability of attention learning. Our DCANet is generic. It is not limited to a specific attention module or base network architecture. Experimental results on ImageNet and MS COCO benchmarks show that DCANet consistently outperforms the state-of-the-art attention modules with a minimal additional computational overhead in all test cases. All code and models are made publicly available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes