LG AIJun 26, 2021

Intrinsically Motivated Self-supervised Learning in Reinforcement Learning

Yue Zhao, Chenzhuang Du, Hang Zhao, Tiejun Li

arXiv:2106.13970v23.15 citations

Originality Incremental advance

AI Analysis

This addresses sample efficiency and generalization issues in vision-based robotics tasks, representing an incremental improvement over existing methods.

The paper tackles the problem of underutilizing information in self-supervised auxiliary tasks in vision-based reinforcement learning by proposing IM-SSR, which uses self-supervised loss as an intrinsic reward, leading to salient improvements in sample efficiency and generalization, especially in sparse reward scenarios.

In vision-based reinforcement learning (RL) tasks, it is prevalent to assign auxiliary tasks with a surrogate self-supervised loss so as to obtain more semantic representations and improve sample efficiency. However, abundant information in self-supervised auxiliary tasks has been disregarded, since the representation learning part and the decision-making part are separated. To sufficiently utilize information in auxiliary tasks, we present a simple yet effective idea to employ self-supervised loss as an intrinsic reward, called Intrinsically Motivated Self-Supervised learning in Reinforcement learning (IM-SSR). We formally show that the self-supervised loss can be decomposed as exploration for novel states and robustness improvement from nuisance elimination. IM-SSR can be effortlessly plugged into any reinforcement learning with self-supervised auxiliary objectives with nearly no additional cost. Combined with IM-SSR, the previous underlying algorithms achieve salient improvements on both sample efficiency and generalization in various vision-based robotics tasks from the DeepMind Control Suite, especially when the reward signal is sparse.

View on arXiv PDF

Similar