LGMLJun 20, 2018

Reinforcement Learning using Augmented Neural Networks

arXiv:1806.07692v1
Originality Synthesis-oriented
AI Analysis

This addresses a stability issue in reinforcement learning for practitioners using neural networks, but it appears incremental as it builds on known problems with existing methods.

The paper tackled instability in deep Q-networks (DQN) caused by globalized updates in neural networks, showing that simple structural modifications to multi-layer perceptrons can improve DQN learning stability.

Neural networks allow Q-learning reinforcement learning agents such as deep Q-networks (DQN) to approximate complex mappings from state spaces to value functions. However, this also brings drawbacks when compared to other function approximators such as tile coding or their generalisations, radial basis functions (RBF) because they introduce instability due to the side effect of globalised updates present in neural networks. This instability does not even vanish in neural networks that do not have any hidden layers. In this paper, we show that simple modifications to the structure of the neural network can improve stability of DQN learning when a multi-layer perceptron is used for function approximation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes