CVJun 4, 2021

Hybrid attention network based on progressive embedding scale-context for crowd counting

arXiv:2106.02324v130 citations
Originality Incremental advance
AI Analysis

This work addresses crowd counting for computer vision applications, presenting an incremental improvement by integrating existing attention mechanisms with a novel progressive learning strategy.

The paper tackles the problems of background noise and scale variation in crowd counting by proposing a Hybrid Attention Network (HAN) with Progressive Embedding Scale-context (PES), achieving state-of-the-art performance on four mainstream datasets.

The existing crowd counting methods usually adopted attention mechanism to tackle background noise, or applied multi-level features or multi-scales context fusion to tackle scale variation. However, these approaches deal with these two problems separately. In this paper, we propose a Hybrid Attention Network (HAN) by employing Progressive Embedding Scale-context (PES) information, which enables the network to simultaneously suppress noise and adapt head scale variation. We build the hybrid attention mechanism through paralleling spatial attention and channel attention module, which makes the network to focus more on the human head area and reduce the interference of background objects. Besides, we embed certain scale-context to the hybrid attention along the spatial and channel dimensions for alleviating these counting errors caused by the variation of perspective and head scale. Finally, we propose a progressive learning strategy through cascading multiple hybrid attention modules with embedding different scale-context, which can gradually integrate different scale-context information into the current feature map from global to local. Ablation experiments provides that the network architecture can gradually learn multi-scale features and suppress background noise. Extensive experiments demonstrate that HANet obtain state-of-the-art counting performance on four mainstream datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes