LGMay 8, 2021

RAIL: A modular framework for Reinforcement-learning-based Adversarial Imitation Learning

Eddy Hudson, Garrett Warnell, Peter Stone

arXiv:2105.03756v14.45 citations

Originality Incremental advance

AI Analysis

This work addresses the need for better understanding and performance in imitation learning for robotics and AI applications, though it appears incremental as it builds on existing AIL methods.

The authors tackled the problem of unclear design decisions in Adversarial Imitation Learning (AIL) by introducing RAIL, a modular framework that generalizes existing AIL approaches, and developed SAIfO, a new algorithm that outperforms contemporaneous methods on locomotion tasks from OpenAI Gym.

While Adversarial Imitation Learning (AIL) algorithms have recently led to state-of-the-art results on various imitation learning benchmarks, it is unclear as to what impact various design decisions have on performance. To this end, we present here an organizing, modular framework called Reinforcement-learning-based Adversarial Imitation Learning (RAIL) that encompasses and generalizes a popular subclass of existing AIL approaches. Using the view espoused by RAIL, we create two new IfO (Imitation from Observation) algorithms, which we term SAIfO: SAC-based Adversarial Imitation from Observation and SILEM (Skeletal Feature Compensation for Imitation Learning with Embodiment Mismatch). We go into greater depth about SILEM in a separate technical report. In this paper, we focus on SAIfO, evaluating it on a suite of locomotion tasks from OpenAI Gym, and showing that it outperforms contemporaneous RAIL algorithms that perform IfO.

View on arXiv PDF

Similar