LGAINEROMLAug 21, 2020

Curriculum Learning with Hindsight Experience Replay for Sequential Object Manipulation Tasks

arXiv:2008.09377v133 citations
AI Analysis

This addresses the problem of training AI agents on complex, multi-goal manipulation tasks with sparse feedback, which is incremental as it builds on existing HER methods.

The paper tackles learning sequential object manipulation tasks with sparse rewards by combining curriculum learning with Hindsight Experience Replay (HER), showing vast improvements over vanilla-HER on three challenging throwing tasks.

Learning complex tasks from scratch is challenging and often impossible for humans as well as for artificial agents. A curriculum can be used instead, which decomposes a complex task (target task) into a sequence of source tasks (the curriculum). Each source task is a simplified version of the next source task with increasing complexity. Learning then occurs gradually by training on each source task while using knowledge from the curriculum's prior source tasks. In this study, we present a new algorithm that combines curriculum learning with Hindsight Experience Replay (HER), to learn sequential object manipulation tasks for multiple goals and sparse feedback. The algorithm exploits the recurrent structure inherent in many object manipulation tasks and implements the entire learning process in the original simulation without adjusting it to each source task. We have tested our algorithm on three challenging throwing tasks and show vast improvements compared to vanilla-HER.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes