CVMar 27, 2020

Weakly Supervised Dataset Collection for Robust Person Detection

Munetaka Minoguchi, Ken Okayama, Yutaka Satoh, Hirokatsu Kataoka

arXiv:2003.12263v22.31 citationsh-index: 23Has Code

Originality Synthesis-oriented

AI Analysis

This provides a larger dataset for the person detection research community, though it is incremental as it builds on existing methods with new data.

The paper tackles the problem of limited person detection datasets by introducing a weakly supervised dataset with 8.7 million images, which improves pre-trained model accuracy by 13.38% and 6.38% compared to fully supervised datasets like ImageNet and EuroCity Persons.

To construct an algorithm that can provide robust person detection, we present a dataset with over 8 million images that was produced in a weakly supervised manner. Through labor-intensive human annotation, the person detection research community has produced relatively small datasets containing on the order of 100,000 images, such as the EuroCity Persons dataset, which includes 240,000 bounding boxes. Therefore, we have collected 8.7 million images of persons based on a two-step collection process, namely person detection with an existing detector and data refinement for false positive suppression. According to the experimental results, the Weakly Supervised Person Dataset (WSPD) is simple yet effective for person detection pre-training. In the context of pre-trained person detection algorithms, our WSPD pre-trained model has 13.38 and 6.38% better accuracy than the same model trained on the fully supervised ImageNet and EuroCity Persons datasets, respectively, when verified with the Caltech Pedestrian.

View on arXiv PDF Code

Similar