LGCVIVMLApr 28, 2020

Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels

arXiv:2004.13649v4928 citations
Originality Highly original
AI Analysis

This addresses the challenge of robust pixel-based learning for reinforcement learning practitioners, offering a simple, broadly applicable augmentation technique.

The paper tackles the problem of training deep reinforcement learning agents directly from pixels without auxiliary losses or pre-training, achieving state-of-the-art performance on the DeepMind control suite by surpassing model-based and contrastive learning methods.

We propose a simple data augmentation technique that can be applied to standard model-free reinforcement learning algorithms, enabling robust learning directly from pixels without the need for auxiliary losses or pre-training. The approach leverages input perturbations commonly used in computer vision tasks to regularize the value function. Existing model-free approaches, such as Soft Actor-Critic (SAC), are not able to train deep networks effectively from image pixels. However, the addition of our augmentation method dramatically improves SAC's performance, enabling it to reach state-of-the-art performance on the DeepMind control suite, surpassing model-based (Dreamer, PlaNet, and SLAC) methods and recently proposed contrastive learning (CURL). Our approach can be combined with any model-free reinforcement learning algorithm, requiring only minor modifications. An implementation can be found at https://sites.google.com/view/data-regularized-q.

Code Implementations4 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes