CVNov 1, 2021

Dense Prediction with Attentive Feature Aggregation

arXiv:2111.00770v39 citations
Originality Highly original
AI Analysis

This work addresses a bottleneck in dense prediction tasks like semantic segmentation and boundary detection for computer vision applications, offering a novel method that is broadly applicable with minimal overhead.

The paper tackles the problem of limited expressiveness in feature aggregation for dense prediction models by introducing Attentive Feature Aggregation (AFA), which uses spatial and channel attention to fuse network layers, resulting in significant improvements such as a nearly 6% mIoU increase on Cityscapes and new state-of-the-art results on boundary detection benchmarks.

Aggregating information from features across different layers is an essential operation for dense prediction models. Despite its limited expressiveness, feature concatenation dominates the choice of aggregation operations. In this paper, we introduce Attentive Feature Aggregation (AFA) to fuse different network layers with more expressive non-linear operations. AFA exploits both spatial and channel attention to compute weighted average of the layer activations. Inspired by neural volume rendering, we extend AFA with Scale-Space Rendering (SSR) to perform late fusion of multi-scale predictions. AFA is applicable to a wide range of existing network designs. Our experiments show consistent and significant improvements on challenging semantic segmentation benchmarks, including Cityscapes, BDD100K, and Mapillary Vistas, at negligible computational and parameter overhead. In particular, AFA improves the performance of the Deep Layer Aggregation (DLA) model by nearly 6% mIoU on Cityscapes. Our experimental analyses show that AFA learns to progressively refine segmentation maps and to improve boundary details, leading to new state-of-the-art results on boundary detection benchmarks on BSDS500 and NYUDv2. Code and video resources are available at http://vis.xyz/pub/dla-afa.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes