CVDec 6, 2017

Joint 3D Proposal Generation and Object Detection from View Aggregation

arXiv:1712.02294v41575 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the problem of accurate and efficient 3D perception for autonomous vehicles, representing an incremental improvement over existing methods.

The authors tackled 3D object detection for autonomous driving by proposing AVOD, a network that aggregates LIDAR and RGB data to generate 3D proposals and detect objects, achieving state-of-the-art results on the KITTI benchmark with real-time performance.

We present AVOD, an Aggregate View Object Detection network for autonomous driving scenarios. The proposed neural network architecture uses LIDAR point clouds and RGB images to generate features that are shared by two subnetworks: a region proposal network (RPN) and a second stage detector network. The proposed RPN uses a novel architecture capable of performing multimodal feature fusion on high resolution feature maps to generate reliable 3D object proposals for multiple object classes in road scenes. Using these proposals, the second stage detection network performs accurate oriented 3D bounding box regression and category classification to predict the extents, orientation, and classification of objects in 3D space. Our proposed architecture is shown to produce state of the art results on the KITTI 3D object detection benchmark while running in real time with a low memory footprint, making it a suitable candidate for deployment on autonomous vehicles. Code is at: https://github.com/kujason/avod

Code Implementations4 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes