CVLGROJun 25, 2020

One Thousand and One Hours: Self-driving Motion Prediction Dataset

arXiv:2006.14480v2467 citations
Originality Synthesis-oriented
AI Analysis

This dataset addresses the need for large-scale, detailed data to advance machine learning tasks like motion forecasting and planning in self-driving systems, though it is incremental as it builds on existing dataset efforts.

The authors introduced the largest self-driving dataset for motion prediction, containing over 1,000 hours of data from 20 autonomous vehicles, and showed that using such a large dataset dramatically improves performance for key self-driving problems.

Motivated by the impact of large-scale datasets on ML systems we present the largest self-driving dataset for motion prediction to date, containing over 1,000 hours of data. This was collected by a fleet of 20 autonomous vehicles along a fixed route in Palo Alto, California, over a four-month period. It consists of 170,000 scenes, where each scene is 25 seconds long and captures the perception output of the self-driving system, which encodes the precise positions and motions of nearby vehicles, cyclists, and pedestrians over time. On top of this, the dataset contains a high-definition semantic map with 15,242 labelled elements and a high-definition aerial view over the area. We show that using a dataset of this size dramatically improves performance for key self-driving problems. Combined with the provided software kit, this collection forms the largest and most detailed dataset to date for the development of self-driving machine learning tasks, such as motion forecasting, motion planning and simulation. The full dataset is available at http://level5.lyft.com/.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes