CV LG IVOct 19, 2022

Non-iterative optimization of pseudo-labeling thresholds for training object detection models from multiple datasets

Yuki Tanaka, Shuhei M. Yoshida, Makoto Terao

arXiv:2210.10221v13.72 citationsh-index: 6

Originality Incremental advance

AI Analysis

This work addresses a time-consuming bottleneck in pseudo-labeling for object detection, offering an incremental improvement for researchers and practitioners using multi-dataset training.

The paper tackles the problem of optimizing pseudo-labeling thresholds for training object detection models from multiple datasets with partial annotations, proposing a non-iterative method that achieves mAP comparable to grid search on COCO and VOC datasets.

We propose a non-iterative method to optimize pseudo-labeling thresholds for learning object detection from a collection of low-cost datasets, each of which is annotated for only a subset of all the object classes. A popular approach to this problem is first to train teacher models and then to use their confident predictions as pseudo ground-truth labels when training a student model. To obtain the best result, however, thresholds for prediction confidence must be adjusted. This process typically involves iterative search and repeated training of student models and is time-consuming. Therefore, we develop a method to optimize the thresholds without iterative optimization by maximizing the $F_β$-score on a validation dataset, which measures the quality of pseudo labels and can be measured without training a student model. We experimentally demonstrate that our proposed method achieves an mAP comparable to that of grid search on the COCO and VOC datasets.

View on arXiv PDF

Similar