CVMar 11, 2025

Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels

arXiv:2503.08421v27 citationsh-index: 34Has CodeCVPR
Originality Incremental advance
AI Analysis

This addresses the need for efficient offline annotation in autonomous driving by improving unsupervised detection, though it appears incremental as it builds on existing clustering-based approaches.

The paper tackles the problem of low-quality pseudo-labels in unsupervised 3D object detection by introducing DOtA, a method that uses multi-agent LiDAR scans without manual labels, and shows it outperforms state-of-the-art methods on datasets like V2V4Real and OPV2V.

Unsupervised 3D object detection serves as an important solution for offline 3D object annotation. However, due to the data sparsity and limited views, the clustering-based label fitting in unsupervised object detection often generates low-quality pseudo-labels. Multi-agent collaborative dataset, which involves the sharing of complementary observations among agents, holds the potential to break through this bottleneck. In this paper, we introduce a novel unsupervised method that learns to Detect Objects from Multi-Agent LiDAR scans, termed DOtA, without using labels from external. DOtA first uses the internally shared ego-pose and ego-shape of collaborative agents to initialize the detector, leveraging the generalization performance of neural networks to infer preliminary labels. Subsequently,DOtA uses the complementary observations between agents to perform multi-scale encoding on preliminary labels, then decodes high-quality and low-quality labels. These labels are further used as prompts to guide a correct feature learning process, thereby enhancing the performance of the unsupervised object detection task. Extensive experiments on the V2V4Real and OPV2V datasets show that our DOtA outperforms state-of-the-art unsupervised 3D object detection methods. Additionally, we also validate the effectiveness of the DOtA labels under various collaborative perception frameworks.The code is available at https://github.com/xmuqimingxia/DOtA.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes