LG AI CV GTAug 11, 2021

An Approach to Partial Observability in Games: Learning to Both Act and Observe

Elizabeth Gilmour, Noah Plotkin, Leslie Smith

arXiv:2108.05701v11 citations

Originality Incremental advance

AI Analysis

This addresses the challenge of partial observability in games like Starcraft II and real-world settings for RL researchers, though it is incremental as it builds on existing RL methods.

The paper tackled the problem of reinforcement learning in partially observable environments by developing a method to mask parts of Atari games, enabling agents to learn both where to look and how to play, and verified that a simple model can learn to look effectively with limited visual bandwidth.

Reinforcement learning (RL) is successful at learning to play games where the entire environment is visible. However, RL approaches are challenged in complex games like Starcraft II and in real-world environments where the entire environment is not visible. In these more complex games with more limited visual information, agents must choose where to look and how to optimally use their limited visual information in order to succeed at the game. We verify that with a relatively simple model the agent can learn where to look in scenarios with a limited visual bandwidth. We develop a method for masking part of the environment in Atari games to force the RL agent to learn both where to look and how to play the game in order to study where the RL agent learns to look. In addition, we develop a neural network architecture and method for allowing the agent to choose where to look and what action to take in the Pong game. Further, we analyze the strategies the agent learns to better understand how the RL agent learns to play the game.

View on arXiv PDF

Similar