ETARLGNEApr 15, 2022

Experimentally realized memristive memory augmented neural network

arXiv:2204.07429v11 citationsh-index: 55
Originality Incremental advance
AI Analysis

This work addresses the problem of practical on-device lifelong learning for AI systems, though it appears incremental as it builds on existing memory augmented neural network concepts with hardware improvements.

The authors tackled the challenge of implementing memory augmented neural networks for lifelong on-device learning by building a fully integrated memristive crossbar platform, achieving an accuracy close to standard software on the Omniglot dataset and demonstrating scalability for one-shot learning on Mini-ImageNet.

Lifelong on-device learning is a key challenge for machine intelligence, and this requires learning from few, often single, samples. Memory augmented neural network has been proposed to achieve the goal, but the memory module has to be stored in an off-chip memory due to its size. Therefore the practical use has been heavily limited. Previous works on emerging memory-based implementation have difficulties in scaling up because different modules with various structures are difficult to integrate on the same chip and the small sense margin of the content addressable memory for the memory module heavily limited the degree of mismatch calculation. In this work, we implement the entire memory augmented neural network architecture in a fully integrated memristive crossbar platform and achieve an accuracy that closely matches standard software on digital hardware for the Omniglot dataset. The successful demonstration is supported by implementing new functions in crossbars in addition to widely reported matrix multiplications. For example, the locality-sensitive hashing operation is implemented in crossbar arrays by exploiting the intrinsic stochasticity of memristor devices. Besides, the content-addressable memory module is realized in crossbars, which also supports the degree of mismatches. Simulations based on experimentally validated models show such an implementation can be efficiently scaled up for one-shot learning on the Mini-ImageNet dataset. The successful demonstration paves the way for practical on-device lifelong learning and opens possibilities for novel attention-based algorithms not possible in conventional hardware.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes