ITMLMay 26, 2016

Distributed Sequence Memory of Multidimensional Inputs in Recurrent Networks

arXiv:1605.08346v321 citations
Originality Incremental advance
AI Analysis

This provides a theoretical advance for machine learning researchers studying memory properties in RNNs, but it is incremental as it builds on existing analyses.

The paper tackled the problem of analyzing short-term memory capacity in recurrent neural networks, specifically echo state networks, for multidimensional inputs with low-dimensional structure, showing that network size scales linearly with information rate and poly-logarithmically with ambient dimension.

Recurrent neural networks (RNNs) have drawn interest from machine learning researchers because of their effectiveness at preserving past inputs for time-varying data processing tasks. To understand the success and limitations of RNNs, it is critical that we advance our analysis of their fundamental memory properties. We focus on echo state networks (ESNs), which are RNNs with simple memoryless nodes and random connectivity. In most existing analyses, the short-term memory (STM) capacity results conclude that the ESN network size must scale linearly with the input size for unstructured inputs. The main contribution of this paper is to provide general results characterizing the STM capacity for linear ESNs with multidimensional input streams when the inputs have common low-dimensional structure: sparsity in a basis or significant statistical dependence between inputs. In both cases, we show that the number of nodes in the network must scale linearly with the information rate and poly-logarithmically with the ambient input dimension. The analysis relies on advanced applications of random matrix theory and results in explicit non-asymptotic bounds on the recovery error. Taken together, this analysis provides a significant step forward in our understanding of the STM properties in RNNs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes