CVMar 23, 2023

Collaboration Helps Camera Overtake LiDAR in 3D Detection

arXiv:2303.13560v1131 citationsh-index: 50Has Code
Originality Highly original
AI Analysis

This work addresses the problem of accurate 3D object localization for autonomous systems using cost-effective camera setups, offering a novel collaborative approach that could potentially surpass LiDAR in some scenarios.

The paper tackles the challenge of precise depth estimation in camera-only 3D detection by introducing multi-agent collaborations, where agents share complementary information to disambiguate depth and handle occlusions, resulting in improvements of up to 44.21% over previous state-of-the-art methods on benchmark datasets.

Camera-only 3D detection provides an economical solution with a simple configuration for localizing objects in 3D space compared to LiDAR-based detection systems. However, a major challenge lies in precise depth estimation due to the lack of direct 3D measurements in the input. Many previous methods attempt to improve depth estimation through network designs, e.g., deformable layers and larger receptive fields. This work proposes an orthogonal direction, improving the camera-only 3D detection by introducing multi-agent collaborations. Our proposed collaborative camera-only 3D detection (CoCa3D) enables agents to share complementary information with each other through communication. Meanwhile, we optimize communication efficiency by selecting the most informative cues. The shared messages from multiple viewpoints disambiguate the single-agent estimated depth and complement the occluded and long-range regions in the single-agent view. We evaluate CoCa3D in one real-world dataset and two new simulation datasets. Results show that CoCa3D improves previous SOTA performances by 44.21% on DAIR-V2X, 30.60% on OPV2V+, 12.59% on CoPerception-UAVs+ for AP@70. Our preliminary results show a potential that with sufficient collaboration, the camera might overtake LiDAR in some practical scenarios. We released the dataset and code at https://siheng-chen.github.io/dataset/CoPerception+ and https://github.com/MediaBrain-SJTU/CoCa3D.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes