LGMLApr 1, 2019

Generative predecessor models for sample-efficient imitation learning

arXiv:1904.01139v131 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of reducing the need for extensive expert data in imitation learning, particularly for robotics, though it appears incremental as it builds on existing methods.

The paper tackles the problem of sample-efficient imitation learning by proposing Generative Predecessor Models for Imitation Learning (GPRIL), which uses generative models to match state-action distributions from expert demonstrations, resulting in robust policies with fewer demonstrations and outperforming a state-of-the-art method on simulated and real robot tasks.

We propose Generative Predecessor Models for Imitation Learning (GPRIL), a novel imitation learning algorithm that matches the state-action distribution to the distribution observed in expert demonstrations, using generative models to reason probabilistically about alternative histories of demonstrated states. We show that this approach allows an agent to learn robust policies using only a small number of expert demonstrations and self-supervised interactions with the environment. We derive this approach from first principles and compare it empirically to a state-of-the-art imitation learning method, showing that it outperforms or matches its performance on two simulated robot manipulation tasks and demonstrate significantly higher sample efficiency by applying the algorithm on a real robot.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes