CVLGMar 2, 2021

Pseudo-labeling for Scalable 3D Object Detection

arXiv:2103.02093v146 citations
Originality Incremental advance
AI Analysis

This addresses the high cost of data annotation for autonomous vehicle perception, offering a scalable solution for deployment across diverse environments, though it is incremental as it builds on existing pseudo-labeling techniques.

The paper tackles the problem of costly labeled data for 3D object detection in autonomous vehicles by using pseudo-labeling on unlabeled data, achieving state-of-the-art accuracy with student models that outperform supervised models trained on 3-10 times more labeled examples and generalize better to new domains.

To safely deploy autonomous vehicles, onboard perception systems must work reliably at high accuracy across a diverse set of environments and geographies. One of the most common techniques to improve the efficacy of such systems in new domains involves collecting large labeled datasets, but such datasets can be extremely costly to obtain, especially if each new deployment geography requires additional data with expensive 3D bounding box annotations. We demonstrate that pseudo-labeling for 3D object detection is an effective way to exploit less expensive and more widely available unlabeled data, and can lead to performance gains across various architectures, data augmentation strategies, and sizes of the labeled dataset. Overall, we show that better teacher models lead to better student models, and that we can distill expensive teachers into efficient, simple students. Specifically, we demonstrate that pseudo-label-trained student models can outperform supervised models trained on 3-10 times the amount of labeled examples. Using PointPillars [24], a two-year-old architecture, as our student model, we are able to achieve state of the art accuracy simply by leveraging large quantities of pseudo-labeled data. Lastly, we show that these student models generalize better than supervised models to a new domain in which we only have unlabeled data, making pseudo-label training an effective form of unsupervised domain adaptation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes