CV LG ROJun 21, 2019

Rules of the Road: Predicting Driving Behavior with a Convolutional Model of Semantic Interactions

arXiv:1906.08945v129.5285 citations

Originality Synthesis-oriented

AI Analysis

This addresses the problem of long-term prediction in self-driving systems for industry applications, but it is incremental as it builds on existing methods with new data and representation.

The paper tackles predicting future states of entities in driving scenarios by leveraging high-level semantic information from 3D perception and maps, using a convolutional model to encode interactions, and shows it can effectively learn driving behavior fundamentals.

We focus on the problem of predicting future states of entities in complex, real-world driving scenarios. Previous research has used low-level signals to predict short time horizons, and has not addressed how to leverage key assets relied upon heavily by industry self-driving systems: (1) large 3D perception efforts which provide highly accurate 3D states of agents with rich attributes, and (2) detailed and accurate semantic maps of the environment (lanes, traffic lights, crosswalks, etc). We present a unified representation which encodes such high-level semantic information in a spatial grid, allowing the use of deep convolutional models to fuse complex scene context. This enables learning entity-entity and entity-environment interactions with simple, feed-forward computations in each timestep within an overall temporal model of an agent's behavior. We propose different ways of modelling the future as a distribution over future states using standard supervised learning. We introduce a novel dataset providing industry-grade rich perception and semantic inputs, and empirically show we can effectively learn fundamentals of driving behavior.

View on arXiv PDF

Similar