CVApr 16, 2020

Spatially Attentive Output Layer for Image Classification

arXiv:2004.07570v127 citations
AI Analysis

This work addresses the problem of underutilizing spatial information in image classification for researchers and practitioners, offering an incremental improvement through a plug-in module.

The paper tackles the limitation of global average pooling in CNNs for image classification by proposing a spatially attentive output layer (SAOL) that uses location-specific attention to aggregate spatial logits, resulting in consistent performance improvements across various tasks with minimal computational overhead.

Most convolutional neural networks (CNNs) for image classification use a global average pooling (GAP) followed by a fully-connected (FC) layer for output logits. However, this spatial aggregation procedure inherently restricts the utilization of location-specific information at the output layer, although this spatial information can be beneficial for classification. In this paper, we propose a novel spatial output layer on top of the existing convolutional feature maps to explicitly exploit the location-specific output information. In specific, given the spatial feature maps, we replace the previous GAP-FC layer with a spatially attentive output layer (SAOL) by employing a attention mask on spatial logits. The proposed location-specific attention selectively aggregates spatial logits within a target region, which leads to not only the performance improvement but also spatially interpretable outputs. Moreover, the proposed SAOL also permits to fully exploit location-specific self-supervision as well as self-distillation to enhance the generalization ability during training. The proposed SAOL with self-supervision and self-distillation can be easily plugged into existing CNNs. Experimental results on various classification tasks with representative architectures show consistent performance improvements by SAOL at almost the same computational cost.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes