CVAILGMar 10, 2022

Back to Reality: Weakly-supervised 3D Object Detection with Shape-guided Label Enhancement

Tsinghua
arXiv:2203.05238v331 citationsh-index: 97Has Code
Originality Incremental advance
AI Analysis

This work addresses the high labeling cost for 3D object detection in indoor scenes, offering a weakly-supervised solution that is incremental in improving efficiency.

The paper tackles the problem of 3D object detection with weak supervision by proposing a method that uses synthetic 3D shapes to enhance position-level annotations, achieving comparable performance to fully-supervised approaches on ScanNet with less than 5% of the labeling effort.

In this paper, we propose a weakly-supervised approach for 3D object detection, which makes it possible to train a strong 3D detector with position-level annotations (i.e. annotations of object centers). In order to remedy the information loss from box annotations to centers, our method, namely Back to Reality (BR), makes use of synthetic 3D shapes to convert the weak labels into fully-annotated virtual scenes as stronger supervision, and in turn utilizes the perfect virtual labels to complement and refine the real labels. Specifically, we first assemble 3D shapes into physically reasonable virtual scenes according to the coarse scene layout extracted from position-level annotations. Then we go back to reality by applying a virtual-to-real domain adaptation method, which refine the weak labels and additionally supervise the training of detector with the virtual scenes. Furthermore, we propose a more challenging benckmark for indoor 3D object detection with more diversity in object sizes to better show the potential of BR. With less than 5% of the labeling labor, we achieve comparable detection performance with some popular fully-supervised approaches on the widely used ScanNet dataset. Code is available at: https://github.com/wyf-ACCEPT/BackToReality

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes