LG CV MLJun 27, 2019

Supervise Thyself: Examining Self-Supervised Representations in Interactive Environments

arXiv:1906.11951v14.13 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This is an incremental study for researchers in reinforcement learning, addressing how self-supervised representations perform in interactive settings.

The study evaluated self-supervised learning methods on visual environments like Flappy Bird and Sonic The Hedgehog, finding that representation utility depends heavily on environment visuals and dynamics.

Self-supervised methods, wherein an agent learns representations solely by observing the results of its actions, become crucial in environments which do not provide a dense reward signal or have labels. In most cases, such methods are used for pretraining or auxiliary tasks for "downstream" tasks, such as control, exploration, or imitation learning. However, it is not clear which method's representations best capture meaningful features of the environment, and which are best suited for which types of environments. We present a small-scale study of self-supervised methods on two visual environments: Flappy Bird and Sonic The Hedgehog. In particular, we quantitatively evaluate the representations learned from these tasks in two contexts: a) the extent to which the representations capture true state information of the agent and b) how generalizable these representations are to novel situations, like new levels and textures. Lastly, we evaluate these self-supervised features by visualizing which parts of the environment they focus on. Our results show that the utility of the representations is highly dependent on the visuals and dynamics of the environment.

View on arXiv PDF Code

Similar