LGAIMLJun 12, 2020

A Brief Look at Generalization in Visual Meta-Reinforcement Learning

arXiv:2006.07262v38 citations
AI Analysis

This study addresses the problem of overfitting and scalability in meta-reinforcement learning for researchers and practitioners, but it is incremental as it extends existing generalization concerns to meta-learning.

The paper investigates the generalization performance of meta-reinforcement learning algorithms in high-dimensional, procedurally generated environments, finding that they exhibit strong overfitting and struggle with scalability in challenging tasks.

Due to the realization that deep reinforcement learning algorithms trained on high-dimensional tasks can strongly overfit to their training environments, there have been several studies that investigated the generalization performance of these algorithms. However, there has been no similar study that evaluated the generalization performance of algorithms that were specifically designed for generalization, i.e. meta-reinforcement learning algorithms. In this paper, we assess the generalization performance of these algorithms by leveraging high-dimensional, procedurally generated environments. We find that these algorithms can display strong overfitting when they are evaluated on challenging tasks. We also observe that scalability to high-dimensional tasks with sparse rewards remains a significant problem among many of the current meta-reinforcement learning algorithms. With these results, we highlight the need for developing meta-reinforcement learning algorithms that can both generalize and scale.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes