NEFeb 24, 2018

Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari

arXiv:1802.08842v1108 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving AI performance in complex environments like Atari games for researchers and practitioners, though it is incremental by building on existing ES methods.

The paper tackled the problem of using Evolution Strategies (ES) for playing Atari games, showing that a basic canonical ES algorithm can match or exceed the performance of specialized natural evolution strategies and reinforcement learning algorithms, with concrete performance gains demonstrated on benchmarks.

Evolution Strategies (ES) have recently been demonstrated to be a viable alternative to reinforcement learning (RL) algorithms on a set of challenging deep RL problems, including Atari games and MuJoCo humanoid locomotion benchmarks. While the ES algorithms in that work belonged to the specialized class of natural evolution strategies (which resemble approximate gradient RL algorithms, such as REINFORCE), we demonstrate that even a very basic canonical ES algorithm can achieve the same or even better performance. This success of a basic ES algorithm suggests that the state-of-the-art can be advanced further by integrating the many advances made in the field of ES in the last decades. We also demonstrate qualitatively that ES algorithms have very different performance characteristics than traditional RL algorithms: on some games, they learn to exploit the environment and perform much better while on others they can get stuck in suboptimal local minima. Combining their strengths with those of traditional RL algorithms is therefore likely to lead to new advances in the state of the art.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes