CVROJun 28, 2022

Pedestrian 3D Bounding Box Prediction

arXiv:2206.14195v113 citationsh-index: 45Has Code
Originality Incremental advance
AI Analysis

This addresses the need for longer-horizon motion prediction in autonomous vehicles, though it is incremental by focusing on a new granularity level between existing coarse and fine predictions.

The paper tackles the problem of predicting pedestrians' 3D bounding boxes for autonomous driving safety, presenting a model that achieves effectiveness on synthetic and real-world datasets.

Safety is still the main issue of autonomous driving, and in order to be globally deployed, they need to predict pedestrians' motions sufficiently in advance. While there is a lot of research on coarse-grained (human center prediction) and fine-grained predictions (human body keypoints prediction), we focus on 3D bounding boxes, which are reasonable estimates of humans without modeling complex motion details for autonomous vehicles. This gives the flexibility to predict in longer horizons in real-world settings. We suggest this new problem and present a simple yet effective model for pedestrians' 3D bounding box prediction. This method follows an encoder-decoder architecture based on recurrent neural networks, and our experiments show its effectiveness in both the synthetic (JTA) and real-world (NuScenes) datasets. The learned representation has useful information to enhance the performance of other tasks, such as action anticipation. Our code is available online: https://github.com/vita-epfl/bounding-box-prediction

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes