CV ROApr 21, 2021

FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras

Anthony Hu, Zak Murez, Nikhil Mohan, Sofía Dudas, Jeffrey Hawke, Vijay Badrinarayanan, Roberto Cipolla, Alex Kendall

arXiv:2104.10490v334.1331 citationsh-index: 82Has Code

Originality Incremental advance

AI Analysis

This addresses the need for safe navigation by predicting multimodal future trajectories without relying on HD maps, though it is incremental as it builds on existing prediction methods.

The paper tackles the problem of predicting future instance segmentation and motion of dynamic agents in autonomous driving from monocular camera inputs, achieving state-of-the-art performance on NuScenes and Lyft datasets.

Driving requires interacting with road agents and predicting their future behaviour in order to navigate safely. We present FIERY: a probabilistic future prediction model in bird's-eye view from monocular cameras. Our model predicts future instance segmentation and motion of dynamic agents that can be transformed into non-parametric future trajectories. Our approach combines the perception, sensor fusion and prediction components of a traditional autonomous driving stack by estimating bird's-eye-view prediction directly from surround RGB monocular camera inputs. FIERY learns to model the inherent stochastic nature of the future solely from camera driving data in an end-to-end manner, without relying on HD maps, and predicts multimodal future trajectories. We show that our model outperforms previous prediction baselines on the NuScenes and Lyft datasets. The code and trained models are available at https://github.com/wayveai/fiery.

View on arXiv PDF Code

Similar