LGAIROFeb 15, 2023

Meta-Reinforcement Learning via Exploratory Task Clustering

arXiv:2302.07958v112 citationsh-index: 15
Originality Incremental advance
AI Analysis

This work addresses the challenge of handling diverse task distributions in meta-RL, which is incremental as it builds on prior methods by incorporating clustering to enhance efficiency.

The paper tackled the problem of structured heterogeneity in meta-reinforcement learning tasks by using exploratory clustering to improve knowledge sharing, resulting in more sample-efficient policy adaptation and strong advantages over state-of-the-art baselines in MuJoCo experiments.

Meta-reinforcement learning (meta-RL) aims to quickly solve new tasks by leveraging knowledge from prior tasks. However, previous studies often assume a single mode homogeneous task distribution, ignoring possible structured heterogeneity among tasks. Leveraging such structures can better facilitate knowledge sharing among related tasks and thus improve sample efficiency. In this paper, we explore the structured heterogeneity among tasks via clustering to improve meta-RL. We develop a dedicated exploratory policy to discover task structures via divide-and-conquer. The knowledge of the identified clusters helps to narrow the search space of task-specific information, leading to more sample efficient policy adaptation. Experiments on various MuJoCo tasks showed the proposed method can unravel cluster structures effectively in both rewards and state dynamics, proving strong advantages against a set of state-of-the-art baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes