LG AI MLApr 19, 2017

Simultaneous Policy Learning and Latent State Inference for Imitating Driver Behavior

arXiv:1704.05566v17.736 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses driver behavior imitation for autonomous systems, but it is incremental as it builds on existing methods for latent state inference.

The paper tackled the problem of imitating driver behavior by learning models that account for unobserved variables, resulting in policies that more effectively replicate driver behavior compared to baselines and show strong influence from latent variable settings.

In this work, we propose a method for learning driver models that account for variables that cannot be observed directly. When trained on a synthetic dataset, our models are able to learn encodings for vehicle trajectories that distinguish between four distinct classes of driver behavior. Such encodings are learned without any knowledge of the number of driver classes or any objective that directly requires the models to learn encodings for each class. We show that driving policies trained with knowledge of latent variables are more effective than baseline methods at imitating the driver behavior that they are trained to replicate. Furthermore, we demonstrate that the actions chosen by our policy are heavily influenced by the latent variable settings that are provided to them.

View on arXiv PDF Code

Similar