LGAIHCNEMLSep 6, 2019

DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning

arXiv:1909.02982v241 citations
Originality Synthesis-oriented
AI Analysis

This addresses the interpretability problem for experts working with deep reinforcement learning agents in navigation scenarios, though it is incremental as it builds on existing visual analytics methods.

The researchers tackled the problem of interpreting the internal memory of deep reinforcement learning agents, which is typically a black box, by developing DRLViz, a visual analytics interface that helps experts understand decisions and investigate errors in navigation tasks. They applied it to video game simulators like ViZDoom and reported on expert evaluations and its broader applicability to navigation problems.

We present DRLViz, a visual analytics interface to interpret the internal memory of an agent (e.g. a robot) trained using deep reinforcement learning. This memory is composed of large temporal vectors updated when the agent moves in an environment and is not trivial to understand due to the number of dimensions, dependencies to past vectors, spatial/temporal correlations, and co-correlation between dimensions. It is often referred to as a black box as only inputs (images) and outputs (actions) are intelligible for humans. Using DRLViz, experts are assisted to interpret decisions using memory reduction interactions, and to investigate the role of parts of the memory when errors have been made (e.g. wrong direction). We report on DRLViz applied in the context of video games simulators (ViZDoom) for a navigation scenario with item gathering tasks. We also report on experts evaluation using DRLViz, and applicability of DRLViz to other scenarios and navigation problems beyond simulation games, as well as its contribution to black box models interpretability and explainability in the field of visual analytics.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes