NEApr 23, 2016

Memory and Information Processing in Recurrent Neural Networks

Alireza Goudarzi, Sarah Marzen, Peter Banda, Guy Feldman, Christof Teuscher, Darko Stefanovic

arXiv:1604.06929v18.716 citations

Originality Highly original

AI Analysis

This work provides foundational insights into RNN memory and learning, crucial for developing alternative computing architectures based on short-term memory principles.

The authors tackled the problem of analytically determining the memory capacity and task performance of recurrent neural networks (RNN) with arbitrary structures and correlated inputs, presenting an exact solution that relates network structure to function and computing error bounds based on spectral properties.

Recurrent neural networks (RNN) are simple dynamical systems whose computational power has been attributed to their short-term memory. Short-term memory of RNNs has been previously studied analytically only for the case of orthogonal networks, and only under annealed approximation, and uncorrelated input. Here for the first time, we present an exact solution to the memory capacity and the task-solving performance as a function of the structure of a given network instance, enabling direct determination of the function--structure relation in RNNs. We calculate the memory capacity for arbitrary networks with exponentially correlated input and further related it to the performance of the system on signal processing tasks in a supervised learning setup. We compute the expected error and the worst-case error bound as a function of the spectra of the network and the correlation structure of its inputs and outputs. Our results give an explanation for learning and generalization of task solving using short-term memory, which is crucial for building alternative computer architectures using physical phenomena based on the short-term memory principle.

View on arXiv PDF

Similar