NELGApr 4, 2019

Learning Numeracy: Binary Arithmetic with Neural Turing Machines

arXiv:1904.02478v21 citations
AI Analysis

This work addresses the problem of enhancing neural network capabilities for algorithmic reasoning, though it is incremental as it builds on existing NTM frameworks.

The authors tackled the challenge of training Neural Turing Machines (NTMs) on complex algorithmic tasks by applying them to binary addition and multiplication, achieving successful learning and generalization on these more difficult problems compared to prior work.

One of the main problems encountered so far with recurrent neural networks is that they struggle to retain long-time information dependencies in their recurrent connections. Neural Turing Machines (NTMs) attempt to mitigate this issue by providing the neural network with an external portion of memory, in which information can be stored and manipulated later on. The whole mechanism is differentiable end-to-end, allowing the network to learn how to utilise this long-term memory via stochastic gradient descent. This allows NTMs to infer simple algorithms directly from data sequences. Nonetheless, the model can be hard to train due to a large number of parameters and interacting components and little related work is present. In this work we use NTMs to learn and generalise two arithmetical tasks: binary addition and multiplication. These tasks are two fundamental algorithmic examples in computer science, and are a lot more challenging than the previously explored ones, with which we aim to shed some light on the real capabilities on this neural model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes