LGETSYJan 31, 2022

Neural Network Training with Asymmetric Crosspoint Elements

arXiv:2201.13377v123 citations
AI Analysis

This addresses a critical bottleneck for hardware acceleration of deep neural networks, enabling practical implementation with current resistive memory technologies.

The paper tackles the problem of asymmetric conductance modulation in analog crossbar arrays degrading neural network training performance by introducing Stochastic Hamiltonian Descent, which minimizes a system energy function to exploit asymmetry, enabling functional analog deep learning accelerators with existing devices.

Analog crossbar arrays comprising programmable nonvolatile resistors are under intense investigation for acceleration of deep neural network training. However, the ubiquitous asymmetric conductance modulation of practical resistive devices critically degrades the classification performance of networks trained with conventional algorithms. Here, we describe and experimentally demonstrate an alternative fully-parallel training algorithm: Stochastic Hamiltonian Descent. Instead of conventionally tuning weights in the direction of the error function gradient, this method programs the network parameters to successfully minimize the total energy (Hamiltonian) of the system that incorporates the effects of device asymmetry. We provide critical intuition on why device asymmetry is fundamentally incompatible with conventional training algorithms and how the new approach exploits it as a useful feature instead. Our technique enables immediate realization of analog deep learning accelerators based on readily available device technologies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes