ROAILGNov 4, 2024

Eurekaverse: Environment Curriculum Generation via Large Language Models

arXiv:2411.01775v15 citationsh-index: 37CoRL
Originality Incremental advance
AI Analysis

This addresses the need for expert-driven, domain-specific curriculum design in robotics, offering an automated solution that is incremental but impactful for skill acquisition.

The paper tackles the problem of automating environment curriculum design for robot skill training by using large language models to generate progressively challenging environments, achieving successful real-world transfer in quadrupedal parkour learning and outperforming human-designed courses.

Recent work has demonstrated that a promising strategy for teaching robots a wide range of complex skills is by training them on a curriculum of progressively more challenging environments. However, developing an effective curriculum of environment distributions currently requires significant expertise, which must be repeated for every new domain. Our key insight is that environments are often naturally represented as code. Thus, we probe whether effective environment curriculum design can be achieved and automated via code generation by large language models (LLM). In this paper, we introduce Eurekaverse, an unsupervised environment design algorithm that uses LLMs to sample progressively more challenging, diverse, and learnable environments for skill training. We validate Eurekaverse's effectiveness in the domain of quadrupedal parkour learning, in which a quadruped robot must traverse through a variety of obstacle courses. The automatic curriculum designed by Eurekaverse enables gradual learning of complex parkour skills in simulation and can successfully transfer to the real-world, outperforming manual training courses designed by humans.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes