LGMLJul 19, 2019

Accelerating Reinforcement Learning through GPU Atari Emulation

arXiv:1907.08467v211 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the computational inefficiency in reinforcement learning for researchers and practitioners, though it is incremental as it ports an existing environment to GPU hardware.

The authors tackled the bottleneck of CPU-based emulators in deep reinforcement learning by developing CuLE, a GPU-accelerated Atari emulator that generates up to 155M frames per hour on a single GPU, enabling faster training convergence for algorithms like A2C+V-trace.

We introduce CuLE (CUDA Learning Environment), a CUDA port of the Atari Learning Environment (ALE) which is used for the development of deep reinforcement algorithms. CuLE overcomes many limitations of existing CPU-based emulators and scales naturally to multiple GPUs. It leverages GPU parallelization to run thousands of games simultaneously and it renders frames directly on the GPU, to avoid the bottleneck arising from the limited CPU-GPU communication bandwidth. CuLE generates up to 155M frames per hour on a single GPU, a finding previously achieved only through a cluster of CPUs. Beyond highlighting the differences between CPU and GPU emulators in the context of reinforcement learning, we show how to leverage the high throughput of CuLE by effective batching of the training data, and show accelerated convergence for A2C+V-trace. CuLE is available at https://github.com/NVLabs/cule .

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes