CVLGFeb 23, 2024

Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling

arXiv:2402.15297v111 citationsh-index: 25Has CodeIEEE Trans Pattern Anal Mach Intell
Originality Incremental advance
AI Analysis

This addresses the problem of reducing labeling effort for crowd counting, which is incremental as it builds on existing semi-supervised methods with specific enhancements.

The paper tackles semi-supervised crowd counting by modeling pixel-wise density as a probability distribution, achieving clear outperformance over competitors by a large margin across four datasets under various labeled ratio settings.

This paper focuses on semi-supervised crowd counting, where only a small portion of the training data are labeled. We formulate the pixel-wise density value to regress as a probability distribution, instead of a single deterministic value. On this basis, we propose a semi-supervised crowd-counting model. Firstly, we design a pixel-wise distribution matching loss to measure the differences in the pixel-wise density distributions between the prediction and the ground truth; Secondly, we enhance the transformer decoder by using density tokens to specialize the forwards of decoders w.r.t. different density intervals; Thirdly, we design the interleaving consistency self-supervised learning mechanism to learn from unlabeled data efficiently. Extensive experiments on four datasets are performed to show that our method clearly outperforms the competitors by a large margin under various labeled ratio settings. Code will be released at https://github.com/LoraLinH/Semi-supervised-Counting-via-Pixel-by-pixel-Density-Distribution-Modelling.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes