AILGNEJun 9, 2022

Deep Surrogate Assisted Generation of Environments

arXiv:2206.04199v350 citationsh-index: 32
Originality Incremental advance
AI Analysis

This work addresses the challenge of environment generation for RL testing, offering a more efficient approach that could accelerate agent evaluation and development, though it appears incremental as it builds on existing QD optimization techniques.

The paper tackles the problem of efficiently generating diverse environments for testing reinforcement learning agents by proposing DSAGE, a sample-efficient quality diversity algorithm that uses a deep surrogate model to predict agent behaviors, which significantly outperforms existing methods in discovering collections of environments that elicit diverse behaviors from state-of-the-art agents.

Recent progress in reinforcement learning (RL) has started producing generally capable agents that can solve a distribution of complex environments. These agents are typically tested on fixed, human-authored environments. On the other hand, quality diversity (QD) optimization has been proven to be an effective component of environment generation algorithms, which can generate collections of high-quality environments that are diverse in the resulting agent behaviors. However, these algorithms require potentially expensive simulations of agents on newly generated environments. We propose Deep Surrogate Assisted Generation of Environments (DSAGE), a sample-efficient QD environment generation algorithm that maintains a deep surrogate model for predicting agent behaviors in new environments. Results in two benchmark domains show that DSAGE significantly outperforms existing QD environment generation algorithms in discovering collections of environments that elicit diverse behaviors of a state-of-the-art RL agent and a planning agent. Our source code and videos are available at https://dsagepaper.github.io/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes