CVROMar 21, 2022

Stereo Neural Vernier Caliper

arXiv:2203.11018v26 citationsh-index: 36Has Code
Originality Incremental advance
AI Analysis

This work improves 3D object detection for autonomous driving by offering a more flexible and functional instance-level model, though it is incremental as it builds upon existing scene-centric methods.

The paper tackles the problem of stereo 3D object detection by proposing an instance-level framework to address the limitations of scene-centric approaches, achieving state-of-the-art performance on the KITTI benchmark.

We propose a new object-centric framework for learning-based stereo 3D object detection. Previous studies build scene-centric representations that do not consider the significant variation among outdoor instances and thus lack the flexibility and functionalities that an instance-level model can offer. We build such an instance-level model by formulating and tackling a local update problem, i.e., how to predict a refined update given an initial 3D cuboid guess. We demonstrate how solving this problem can complement scene-centric approaches in (i) building a coarse-to-fine multi-resolution system, (ii) performing model-agnostic object location refinement, and (iii) conducting stereo 3D tracking-by-detection. Extensive experiments demonstrate the effectiveness of our approach, which achieves state-of-the-art performance on the KITTI benchmark. Code and pre-trained models are available at https://github.com/Nicholasli1995/SNVC.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes