CVJun 7, 2022

Self-supervised Domain Adaptation in Crowd Counting

arXiv:2206.03431v217 citationsh-index: 31
AI Analysis

This work addresses the resource-intensive annotation challenge in crowd counting for computer vision applications, representing an incremental advance in domain adaptation techniques.

The paper tackles the problem of reducing manual annotation needs in crowd counting by introducing a self-supervised domain adaptation method that uses labeled source data and unlabeled target data, achieving generalized improvements over state-of-the-art methods on datasets like Shanghaitech, UCF_CC_50, and UCF-QNRF.

Self-training crowd counting has not been attentively explored though it is one of the important challenges in computer vision. In practice, the fully supervised methods usually require an intensive resource of manual annotation. In order to address this challenge, this work introduces a new approach to utilize existing datasets with ground truth to produce more robust predictions on unlabeled datasets, named domain adaptation, in crowd counting. While the network is trained with labeled data, samples without labels from the target domain are also added to the training process. In this process, the entropy map is computed and minimized in addition to the adversarial training process designed in parallel. Experiments on Shanghaitech, UCF_CC_50, and UCF-QNRF datasets prove a more generalized improvement of our method over the other state-of-the-arts in the cross-domain setting.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes