AINov 11, 2020

Reinforcement Learning with Dual-Observation for General Video Game Playing

arXiv:2011.05622v416 citations
AI Analysis

This addresses the challenge of general video game playing for AI researchers, but it is incremental as it builds on existing reinforcement learning methods with a novel input approach.

The paper tackles the problem of improving reinforcement learning agents' generalization to unseen game levels in the General Video Game AI Learning Competition by proposing a dual-observation technique that uses encoded global and local inputs, achieving outstanding performance in ablation studies on the 2020 competition game set.

Reinforcement learning algorithms have performed well in playing challenging board and video games. More and more studies focus on improving the generalisation ability of reinforcement learning algorithms. The General Video Game AI Learning Competition aims to develop agents capable of learning to play different game levels that were unseen during training. This paper summarises the five years' General Video Game AI Learning Competition editions. At each edition, three new games were designed. The training and test levels were designed separately in the first three editions. Since 2020, three test levels of each game were generated by perturbing or combining two training levels. Then, we present a novel reinforcement learning technique with dual-observation for general video game playing, assuming that it is more likely to observe similar local information in different levels rather than global information. Instead of directly inputting a single, raw pixel-based screenshot of the current game screen, our proposed general technique takes the encoded, transformed global and local observations of the game screen as two simultaneous inputs, aiming at learning local information for playing new levels. Our proposed technique is implemented with three state-of-the-art reinforcement learning algorithms and tested on the game set of the 2020 General Video Game AI Learning Competition. Ablation studies show the outstanding performance of using encoded, transformed global and local observations as input.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes