CVROSep 29, 2016

Reconstructing Vechicles from a Single Image: Shape Priors for Road Scene Understanding

arXiv:1609.09468v144 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of 3D scene understanding in autonomous driving, though it is incremental as it builds on prior work with shape priors and CNNs.

The paper tackles the problem of reconstructing 3D vehicle shapes from a single RGB image for autonomous driving, using learned shape priors and a cascaded CNN for keypoint localization, and demonstrates superior performance compared to existing monocular and stereo methods on benchmarks.

We present an approach for reconstructing vehicles from a single (RGB) image, in the context of autonomous driving. Though the problem appears to be ill-posed, we demonstrate that prior knowledge about how 3D shapes of vehicles project to an image can be used to reason about the reverse process, i.e., how shapes (back-)project from 2D to 3D. We encode this knowledge in \emph{shape priors}, which are learnt over a small keypoint-annotated dataset. We then formulate a shape-aware adjustment problem that uses the learnt shape priors to recover the 3D pose and shape of a query object from an image. For shape representation and inference, we leverage recent successes of Convolutional Neural Networks (CNNs) for the task of object and keypoint localization, and train a novel cascaded fully-convolutional architecture to localize vehicle \emph{keypoints} in images. The shape-aware adjustment then robustly recovers shape (3D locations of the detected keypoints) while simultaneously filling in occluded keypoints. To tackle estimation errors incurred due to erroneously detected keypoints, we use an Iteratively Re-weighted Least Squares (IRLS) scheme for robust optimization, and as a by-product characterize noise models for each predicted keypoint. We evaluate our approach on autonomous driving benchmarks, and present superior results to existing monocular, as well as stereo approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes