CVJan 10, 2024

Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object Detection

arXiv:2401.05011v112 citationsh-index: 10AAAI
Originality Incremental advance
AI Analysis

This work addresses the challenge of limited labeled data in 3D object detection, offering a solution for applications like robotics and autonomous systems, though it is incremental as it builds on prior semi-supervised methods.

The paper tackles the problem of reducing annotation costs for 3D object detection in cluttered indoor scenes by proposing a semi-supervised method called DPKE, which enriches knowledge from data and feature perspectives, achieving superior performance over state-of-the-art approaches on benchmark datasets under various label ratios.

Semi-supervised 3D object detection is a promising yet under-explored direction to reduce data annotation costs, especially for cluttered indoor scenes. A few prior works, such as SESS and 3DIoUMatch, attempt to solve this task by utilizing a teacher model to generate pseudo-labels for unlabeled samples. However, the availability of unlabeled samples in the 3D domain is relatively limited compared to its 2D counterpart due to the greater effort required to collect 3D data. Moreover, the loose consistency regularization in SESS and restricted pseudo-label selection strategy in 3DIoUMatch lead to either low-quality supervision or a limited amount of pseudo labels. To address these issues, we present a novel Dual-Perspective Knowledge Enrichment approach named DPKE for semi-supervised 3D object detection. Our DPKE enriches the knowledge of limited training data, particularly unlabeled data, from two perspectives: data-perspective and feature-perspective. Specifically, from the data-perspective, we propose a class-probabilistic data augmentation method that augments the input data with additional instances based on the varying distribution of class probabilities. Our DPKE achieves feature-perspective knowledge enrichment by designing a geometry-aware feature matching method that regularizes feature-level similarity between object proposals from the student and teacher models. Extensive experiments on the two benchmark datasets demonstrate that our DPKE achieves superior performance over existing state-of-the-art approaches under various label ratio conditions. The source code will be made available to the public.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes