LGJun 21, 2017

Generating Long-term Trajectories Using Deep Hierarchical Networks

arXiv:1706.07138v1113 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of learning cohesive long-term behavior from expert demonstrations in domains like sports, where conventional shallow models fail, though it is incremental as it builds on hierarchical policy concepts.

The paper tackles the problem of modeling long-term spatiotemporal trajectories in high-dimensional state spaces, such as sports strategies, by proposing a hierarchical neural network that reasons about both long-term and short-term goals. It demonstrates that this approach generates significantly more realistic basketball trajectories compared to non-hierarchical baselines, as evaluated by professional sports analysts.

We study the problem of modeling spatiotemporal trajectories over long time horizons using expert demonstrations. For instance, in sports, agents often choose action sequences with long-term goals in mind, such as achieving a certain strategic position. Conventional policy learning approaches, such as those based on Markov decision processes, generally fail at learning cohesive long-term behavior in such high-dimensional state spaces, and are only effective when myopic modeling lead to the desired behavior. The key difficulty is that conventional approaches are "shallow" models that only learn a single state-action policy. We instead propose a hierarchical policy class that automatically reasons about both long-term and short-term goals, which we instantiate as a hierarchical neural network. We showcase our approach in a case study on learning to imitate demonstrated basketball trajectories, and show that it generates significantly more realistic trajectories compared to non-hierarchical baselines as judged by professional sports analysts.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes