CVFeb 15, 2023

Efficient Teacher: Semi-Supervised Object Detection for YOLOv5

arXiv:2302.07577v350 citationsh-index: 6Has Code
Originality Incremental advance
AI Analysis

This work improves semi-supervised object detection for one-stage anchor-based detectors, offering a scalable solution for applications with limited labeled data, though it is incremental as it builds on existing YOLOv5 and semi-supervised techniques.

The paper tackles the problem of semi-supervised object detection for one-stage anchor-based detectors like YOLOv5, which suffer from inconsistency issues due to low-quality pseudo labels, and proposes the Efficient Teacher framework to address this, achieving state-of-the-art results on VOC, COCO-standard, and COCO-additional datasets with fewer FLOPs than previous methods.

Semi-Supervised Object Detection (SSOD) has been successful in improving the performance of both R-CNN series and anchor-free detectors. However, one-stage anchor-based detectors lack the structure to generate high-quality or flexible pseudo labels, leading to serious inconsistency problems in SSOD. In this paper, we propose the Efficient Teacher framework for scalable and effective one-stage anchor-based SSOD training, consisting of Dense Detector, Pseudo Label Assigner, and Epoch Adaptor. Dense Detector is a baseline model that extends RetinaNet with dense sampling techniques inspired by YOLOv5. The Efficient Teacher framework introduces a novel pseudo label assignment mechanism, named Pseudo Label Assigner, which makes more refined use of pseudo labels from Dense Detector. Epoch Adaptor is a method that enables a stable and efficient end-to-end semi-supervised training schedule for Dense Detector. The Pseudo Label Assigner prevents the occurrence of bias caused by a large number of low-quality pseudo labels that may interfere with the Dense Detector during the student-teacher mutual learning mechanism, and the Epoch Adaptor utilizes domain and distribution adaptation to allow Dense Detector to learn globally distributed consistent features, making the training independent of the proportion of labeled data. Our experiments show that the Efficient Teacher framework achieves state-of-the-art results on VOC, COCO-standard, and COCO-additional using fewer FLOPs than previous methods. To the best of our knowledge, this is the first attempt to apply Semi-Supervised Object Detection to YOLOv5.Code is available: https://github.com/AlibabaResearch/efficientteacher

Code Implementations4 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes