ML LGMay 23, 2016

Learning and Policy Search in Stochastic Dynamical Systems with Bayesian Neural Networks

Stefan Depeweg, José Miguel Hernández-Lobato, Finale Doshi-Velez, Steffen Udluft

arXiv:1605.07127v328.1167 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of capturing complex statistical patterns like multi-modality and heteroskedasticity in transition dynamics for reinforcement learning, which is incremental as it builds on existing Bayesian neural network methods.

The authors tackled the problem of model-based reinforcement learning in stochastic dynamical systems by developing an algorithm that combines Bayesian neural networks with random roll-outs and stochastic optimization, resulting in improved performance on a challenging benchmark and promising results in controlling a gas turbine.

We present an algorithm for model-based reinforcement learning that combines Bayesian neural networks (BNNs) with random roll-outs and stochastic optimization for policy learning. The BNNs are trained by minimizing $α$-divergences, allowing us to capture complicated statistical patterns in the transition dynamics, e.g. multi-modality and heteroskedasticity, which are usually missed by other common modeling approaches. We illustrate the performance of our method by solving a challenging benchmark where model-based approaches usually fail and by obtaining promising results in a real-world scenario for controlling a gas turbine.

View on arXiv PDF Code

Similar