CVDec 28, 2023

Scale-Aware Crowd Count Network with Annotation Error Correction

arXiv:2312.16771v11 citationsh-index: 4
Originality Incremental advance
AI Analysis

This work addresses crowd counting accuracy in computer vision, which is important for applications like surveillance and event management, but it appears incremental as it builds on existing methods by integrating error correction and scale awareness.

The paper tackled the problem of inaccurate crowd counting due to information loss from pooling layers and noisy annotations by proposing SACC-Net, which models labeling errors and scale variations with spatially-varying Gaussian distributions, achieving state-of-the-art performance on four public datasets.

Traditional crowd counting networks suffer from information loss when feature maps are downsized through pooling layers, leading to inaccuracies in counting crowds at a distance. Existing methods often assume correct annotations during training, disregarding the impact of noisy annotations, especially in crowded scenes. Furthermore, the use of a fixed Gaussian kernel fails to account for the varying pixel distribution with respect to the camera distance. To overcome these challenges, we propose a Scale-Aware Crowd Counting Network (SACC-Net) that introduces a ``scale-aware'' architecture with error-correcting capabilities of noisy annotations. For the first time, we {\bf simultaneously} model labeling errors (mean) and scale variations (variance) by spatially-varying Gaussian distributions to produce fine-grained heat maps for crowd counting. Furthermore, the proposed adaptive Gaussian kernel variance enables the model to learn dynamically with a low-rank approximation, leading to improved convergence efficiency with comparable accuracy. The performance of SACC-Net is extensively evaluated on four public datasets: UCF-QNRF, UCF CC 50, NWPU, and ShanghaiTech A-B. Experimental results demonstrate that SACC-Net outperforms all state-of-the-art methods, validating its effectiveness in achieving superior crowd counting accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes