CVLGIVJul 27, 2021

AASeg: Attention Aware Network for Real Time Semantic Segmentation

arXiv:2108.04349v41 citations
Originality Incremental advance
AI Analysis

It addresses the problem of efficient scene understanding for resource-constrained applications, representing an incremental improvement over prior real-time methods.

The paper tackles the trade-off between accuracy and real-time performance in semantic segmentation by proposing AASeg, which uses lightweight attention and multi-scale context modules, achieving improved results on benchmarks like Cityscapes and ADE20K.

Semantic segmentation is a fundamental task in computer vision that involves dense pixel-wise classification for scene understanding. Despite significant progress, achieving high accuracy while maintaining real-time performance remains a challenging trade-off, particularly for deployment in resource-constrained or latency-sensitive applications. In this paper, we propose AASeg, a novel Attention-Aware Network for real-time semantic segmentation. AASeg effectively captures both spatial and channel-wise dependencies through lightweight Spatial Attention (SA) and Channel Attention (CA) modules, enabling enhanced feature discrimination without incurring significant computational overhead. To enrich contextual representation, we introduce a Multi-Scale Context (MSC) module that aggregates dense local features across multiple receptive fields. The outputs from attention and context modules are adaptively fused to produce high-resolution segmentation maps. Extensive experiments on Cityscapes, ADE20K, and CamVid demonstrate that AASeg achieves a compelling trade-off between accuracy and efficiency, outperforming prior real-time methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes