ETDCLGMar 27, 2020

IMAC: In-memory multi-bit Multiplication andACcumulation in 6T SRAM Array

arXiv:2003.12558v1109 citations
AI Analysis

This addresses the memory bottleneck for on-chip computing in AI hardware, offering a significant improvement in efficiency and speed for neural network inference.

The paper tackles the memory bottleneck in computing by proposing a novel in-memory multiplication and accumulation operation in 6T SRAM arrays, achieving 88.8% and 99% accuracy on CIFAR-10 and MNIST datasets with 6.24x better energy consumption and 9.42x better delay compared to standard systems.

`In-memory computing' is being widely explored as a novel computing paradigm to mitigate the well known memory bottleneck. This emerging paradigm aims at embedding some aspects of computations inside the memory array, thereby avoiding frequent and expensive movement of data between the compute unit and the storage memory. In-memory computing with respect to Silicon memories has been widely explored on various memory bit-cells. Embedding computation inside the 6 transistor (6T) SRAM array is of special interest since it is the most widely used on-chip memory. In this paper, we present a novel in-memory multiplication followed by accumulation operation capable of performing parallel dot products within 6T SRAM without any changes to the standard bitcell. We, further, study the effect of circuit non-idealities and process variations on the accuracy of the LeNet-5 and VGG neural network architectures against the MNIST and CIFAR-10 datasets, respectively. The proposed in-memory dot-product mechanism achieves 88.8% and 99% accuracy for the CIFAR-10 and MNIST, respectively. Compared to the standard von Neumann system, the proposed system is 6.24x better in energy consumption and 9.42x better in delay.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes