LGAIMLNov 14, 2018

Natural Environment Benchmarks for Reinforcement Learning

arXiv:1811.06032v170 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the need for better benchmarks in reinforcement learning to improve generalization, though it is incremental as it builds on existing evaluation frameworks.

The authors tackled the problem of brittle reinforcement learning policies by proposing three new benchmark domains that incorporate natural-world complexity while enabling fast data acquisition and fair evaluation, aiming to encourage the development of more robust algorithms.

While current benchmark reinforcement learning (RL) tasks have been useful to drive progress in the field, they are in many ways poor substitutes for learning with real-world data. By testing increasingly complex RL algorithms on low-complexity simulation environments, we often end up with brittle RL policies that generalize poorly beyond the very specific domain. To combat this, we propose three new families of benchmark RL domains that contain some of the complexity of the natural world, while still supporting fast and extensive data acquisition. The proposed domains also permit a characterization of generalization through fair train/test separation, and easy comparison and replication of results. Through this work, we challenge the RL research community to develop more robust algorithms that meet high standards of evaluation.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes