LGFLMLJun 5, 2020

A provably stable neural network Turing Machine

arXiv:2006.03651v47 citations
Originality Highly original
AI Analysis

This work provides a new theoretical bound for the computational capability of bounded-precision RNNs with memory, which is foundational for understanding neural network expressiveness.

The authors tackled the problem of simulating pushdown automata and Turing machines with bounded-precision recurrent neural networks, proving that their differentiable neural stack and neural Turing machine architectures can simulate any PDA or TM using only seven finite-pounded neurons.

We introduce a neural stack architecture, including a differentiable parametrized stack operator that approximates stack push and pop operations for suitable choices of parameters that explicitly represents a stack. We prove the stability of this stack architecture: after arbitrarily many stack operations, the state of the neural stack still closely resembles the state of the discrete stack. Using the neural stack with a recurrent neural network, we introduce a neural network Pushdown Automaton (nnPDA) and prove that nnPDA with finite/bounded neurons and time can simulate any PDA. Furthermore, we extend our construction and propose new architecture neural state Turing Machine (nnTM). We prove that differentiable nnTM with bounded neurons can simulate Turing Machine (TM) in real-time. Just like the neural stack, these architectures are also stable. Finally, we extend our construction to show that differentiable nnTM is equivalent to Universal Turing Machine (UTM) and can simulate any TM with only \textbf{seven finite/bounded precision} neurons. This work provides a new theoretical bound for the computational capability of bounded precision RNNs augmented with memory.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes