ROAILGOct 16, 2018

Integrating kinematics and environment context into deep inverse reinforcement learning for predicting off-road vehicle trajectories

arXiv:1810.07225v146 citations
Originality Highly original
AI Analysis

This work addresses motion prediction for robotics applications like autonomous navigation, offering a novel integration approach that improves trajectory forecasting in off-road environments.

The paper tackles the problem of predicting off-road vehicle trajectories by integrating kinematics and environmental context into an inverse reinforcement learning framework, resulting in accurate predictions that capture complex behaviors like multi-modal distributions at intersections, validated with over 30 km of driving data.

Predicting the motion of a mobile agent from a third-person perspective is an important component for many robotics applications, such as autonomous navigation and tracking. With accurate motion prediction of other agents, robots can plan for more intelligent behaviors to achieve specified objectives, instead of acting in a purely reactive way. Previous work addresses motion prediction by either only filtering kinematics, or using hand-designed and learned representations of the environment. Instead of separating kinematic and environmental context, we propose a novel approach to integrate both into an inverse reinforcement learning (IRL) framework for trajectory prediction. Instead of exponentially increasing the state-space complexity with kinematics, we propose a two-stage neural network architecture that considers motion and environment together to recover the reward function. The first-stage network learns feature representations of the environment using low-level LiDAR statistics and the second-stage network combines those learned features with kinematics data. We collected over 30 km of off-road driving data and validated experimentally that our method can effectively extract useful environmental and kinematic features. We generate accurate predictions of the distribution of future trajectories of the vehicle, encoding complex behaviors such as multi-modal distributions at road intersections, and even show different predictions at the same intersection depending on the vehicle's speed.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes