LGAIJun 26, 2021

Intrinsically Motivated Self-supervised Learning in Reinforcement Learning

arXiv:2106.13970v25 citations
Originality Incremental advance
AI Analysis

This addresses sample efficiency and generalization issues in vision-based robotics tasks, representing an incremental improvement over existing methods.

The paper tackles the problem of underutilizing information in self-supervised auxiliary tasks in vision-based reinforcement learning by proposing IM-SSR, which uses self-supervised loss as an intrinsic reward, leading to salient improvements in sample efficiency and generalization, especially in sparse reward scenarios.

In vision-based reinforcement learning (RL) tasks, it is prevalent to assign auxiliary tasks with a surrogate self-supervised loss so as to obtain more semantic representations and improve sample efficiency. However, abundant information in self-supervised auxiliary tasks has been disregarded, since the representation learning part and the decision-making part are separated. To sufficiently utilize information in auxiliary tasks, we present a simple yet effective idea to employ self-supervised loss as an intrinsic reward, called Intrinsically Motivated Self-Supervised learning in Reinforcement learning (IM-SSR). We formally show that the self-supervised loss can be decomposed as exploration for novel states and robustness improvement from nuisance elimination. IM-SSR can be effortlessly plugged into any reinforcement learning with self-supervised auxiliary objectives with nearly no additional cost. Combined with IM-SSR, the previous underlying algorithms achieve salient improvements on both sample efficiency and generalization in various vision-based robotics tasks from the DeepMind Control Suite, especially when the reward signal is sparse.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes