LG AIMay 16, 2022

An Empirical Investigation of Representation Learning for Imitation

Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah

Berkeley

arXiv:2205.07886v115.128 citationsh-index: 164Has Code

Originality Synthesis-oriented

AI Analysis

This addresses the problem of expensive demonstration collection for imitation learning practitioners, but the findings suggest the approach may be incremental or less effective than expected.

The paper investigated whether representation learning reduces the need for large expert demonstration sets in imitation learning, finding that existing image-based representation learning algorithms provided limited benefit compared to a well-tuned baseline with image augmentations.

Imitation learning often needs a large demonstration set in order to handle the full range of situations that an agent might find itself in during deployment. However, collecting expert demonstrations can be expensive. Recent work in vision, reinforcement learning, and NLP has shown that auxiliary representation learning objectives can reduce the need for large amounts of expensive, task-specific data. Our Empirical Investigation of Representation Learning for Imitation (EIRLI) investigates whether similar benefits apply to imitation learning. We propose a modular framework for constructing representation learning algorithms, then use our framework to evaluate the utility of representation learning for imitation across several environment suites. In the settings we evaluate, we find that existing algorithms for image-based representation learning provide limited value relative to a well-tuned baseline with image augmentations. To explain this result, we investigate differences between imitation learning and other settings where representation learning has provided significant benefit, such as image classification. Finally, we release a well-documented codebase which both replicates our findings and provides a modular framework for creating new representation learning algorithms out of reusable components.

View on arXiv PDF Code

Similar