NEOct 20, 2014

Neural Turing Machines

arXiv:1410.5401v22548 citations
Originality Highly original
AI Analysis

This work addresses the limitation of neural networks in handling algorithmic tasks, offering a differentiable architecture that could enhance machine learning for sequential and memory-intensive problems.

The authors tackled the problem of extending neural networks with external memory to enable algorithmic learning, achieving preliminary success in inferring simple algorithms like copying, sorting, and associative recall from examples.

We extend the capabilities of neural networks by coupling them to external memory resources, which they can interact with by attentional processes. The combined system is analogous to a Turing Machine or Von Neumann architecture but is differentiable end-to-end, allowing it to be efficiently trained with gradient descent. Preliminary results demonstrate that Neural Turing Machines can infer simple algorithms such as copying, sorting, and associative recall from input and output examples.

Code Implementations34 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes