MLMay 25, 2017

Predictive State Recurrent Neural Networks

Carlton Downey, Ahmed Hefny, Boyue Li, Byron Boots, Geoffrey Gordon

arXiv:1705.09353v213.458 citations

Originality Incremental advance

AI Analysis

This work addresses modeling challenges in dynamical systems for researchers and practitioners, but it appears incremental as it builds on existing RNN and PSR methods.

The authors tackled the problem of filtering and prediction in dynamical systems by introducing Predictive State Recurrent Neural Networks (PSRNNs), which combine insights from RNNs and Predictive State Representations, and demonstrated that PSRNNs outperform several alternative approaches on four datasets.

We present a new model, Predictive State Recurrent Neural Networks (PSRNNs), for filtering and prediction in dynamical systems. PSRNNs draw on insights from both Recurrent Neural Networks (RNNs) and Predictive State Representations (PSRs), and inherit advantages from both types of models. Like many successful RNN architectures, PSRNNs use (potentially deeply composed) bilinear transfer functions to combine information from multiple sources. We show that such bilinear functions arise naturally from state updates in Bayes filters like PSRs, in which observations can be viewed as gating belief states. We also show that PSRNNs can be learned effectively by combining Backpropogation Through Time (BPTT) with an initialization derived from a statistically consistent learning algorithm for PSRs called two-stage regression (2SR). Finally, we show that PSRNNs can be factorized using tensor decomposition, reducing model size and suggesting interesting connections to existing multiplicative architectures such as LSTMs. We applied PSRNNs to 4 datasets, and showed that we outperform several popular alternative approaches to modeling dynamical systems in all cases.

View on arXiv PDF

Similar