NEJun 4, 2018

Challenges in High-dimensional Reinforcement Learning with Evolution Strategies

arXiv:1806.01224v230 citations
Originality Synthesis-oriented
AI Analysis

This work addresses scalability issues in reinforcement learning for control tasks, but it is incremental as it builds on existing ES methods.

The paper tackled the challenge of applying Evolution Strategies (ESs) to high-dimensional, stochastic reinforcement learning problems, revealing insights into effective algorithmic mechanisms and principled limitations, while showing that combining low-cost ESs with uncertainty handling yields promising methods.

Evolution Strategies (ESs) have recently become popular for training deep neural networks, in particular on reinforcement learning tasks, a special form of controller design. Compared to classic problems in continuous direct search, deep networks pose extremely high-dimensional optimization problems, with many thousands or even millions of variables. In addition, many control problems give rise to a stochastic fitness function. Considering the relevance of the application, we study the suitability of evolution strategies for high-dimensional, stochastic problems. Our results give insights into which algorithmic mechanisms of modern ES are of value for the class of problems at hand, and they reveal principled limitations of the approach. They are in line with our theoretical understanding of ESs. We show that combining ESs that offer reduced internal algorithm cost with uncertainty handling techniques yields promising methods for this class of problems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes