LGMLNov 8, 2019

Fully Bayesian Recurrent Neural Networks for Safe Reinforcement Learning

arXiv:1911.03308v26 citations
Originality Incremental advance
AI Analysis

This work addresses safety concerns in autonomous systems for applications like collision avoidance, though it appears incremental as it builds on existing probabilistic methods.

The paper tackled the problem of poor explainability and safety in reinforcement learning by introducing a fully Bayesian Recurrent Neural Network architecture, demonstrating that it significantly outperforms a popular method in collision avoidance tasks with less training and greater efficiency.

Reinforcement Learning (RL) has demonstrated state-of-the-art results in a number of autonomous system applications, however many of the underlying algorithms rely on black-box predictions. This results in poor explainability of the behaviour of these systems, raising concerns as to their use in safety-critical applications. Recent work has demonstrated that uncertainty-aware models exhibit more cautious behaviours through the incorporation of model uncertainty estimates. In this work, we build on Probabilistic Backpropagation to introduce a fully Bayesian Recurrent Neural Network architecture. We apply this within a Safe RL scenario, and demonstrate that the proposed method significantly outperforms a popular approach for obtaining model uncertainties in collision avoidance tasks. Furthermore, we demonstrate that the proposed approach requires less training and is far more efficient than the current leading method, both in terms of compute resource and memory footprint.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes