LGJan 19, 2023

A Tutorial on Meta-Reinforcement Learning

arXiv:2301.08028v4156 citationsh-index: 93
AI Analysis

It provides a comprehensive overview for researchers and practitioners to understand and apply meta-RL, but is incremental as a survey rather than presenting new methods.

This tutorial addresses the poor data efficiency and limited generality of deep reinforcement learning by exploring meta-reinforcement learning as a solution, surveying algorithms and applications to enable adaptation to new tasks with minimal data.

While deep reinforcement learning (RL) has fueled multiple high-profile successes in machine learning, it is held back from more widespread adoption by its often poor data efficiency and the limited generality of the policies it produces. A promising approach for alleviating these limitations is to cast the development of better RL algorithms as a machine learning problem itself in a process called meta-RL. Meta-RL is most commonly studied in a problem setting where, given a distribution of tasks, the goal is to learn a policy that is capable of adapting to any new task from the task distribution with as little data as possible. In this survey, we describe the meta-RL problem setting in detail as well as its major variations. We discuss how, at a high level, meta-RL research can be clustered based on the presence of a task distribution and the learning budget available for each individual task. Using these clusters, we then survey meta-RL algorithms and applications. We conclude by presenting the open problems on the path to making meta-RL part of the standard toolbox for a deep RL practitioner.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes