CVMay 24, 2024

UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes

arXiv:2405.15688v323 citationsh-index: 51Has CodeNIPS
Originality Highly original
AI Analysis

This addresses the need for efficient unsupervised 3D object detection in autonomous driving by reducing computational costs compared to prior methods.

The paper tackles the problem of unsupervised 3D object detection by proposing UNION, which uses object appearance-based pseudo-classes to jointly obtain static and dynamic objects in a single training round, more than doubling the average precision to 39.5 on the nuScenes dataset.

Unsupervised 3D object detection methods have emerged to leverage vast amounts of data without requiring manual labels for training. Recent approaches rely on dynamic objects for learning to detect mobile objects but penalize the detections of static instances during training. Multiple rounds of self-training are used to add detected static instances to the set of training targets; this procedure to improve performance is computationally expensive. To address this, we propose the method UNION. We use spatial clustering and self-supervised scene flow to obtain a set of static and dynamic object proposals from LiDAR. Subsequently, object proposals' visual appearances are encoded to distinguish static objects in the foreground and background by selecting static instances that are visually similar to dynamic objects. As a result, static and dynamic mobile objects are obtained together, and existing detectors can be trained with a single training. In addition, we extend 3D object discovery to detection by using object appearance-based cluster labels as pseudo-class labels for training object classification. We conduct extensive experiments on the nuScenes dataset and increase the state-of-the-art performance for unsupervised 3D object discovery, i.e. UNION more than doubles the average precision to 39.5. The code is available at github.com/TedLentsch/UNION.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes