CVDec 22, 2021

Can Deep Neural Networks be Converted to Ultra Low-Latency Spiking Neural Networks?

arXiv:2112.12133v18.741 citations

Originality Highly original

AI Analysis

This work addresses energy efficiency for resource-constrained devices by enabling faster and more efficient SNNs, representing a strong specific gain rather than an incremental improvement.

The paper tackled the problem of high latency and energy consumption in spiking neural networks (SNNs) by developing a new training algorithm that accurately captures pre-activation distributions, enabling ultra low-latency SNNs with high sparsity. The result was a 64.19% top-1 accuracy on CIFAR-100 with only 2 time steps and ~159.2x lower compute energy compared to standard DNNs, while performing inference 2.5-8x faster than other SOTA SNNs.

Spiking neural networks (SNNs), that operate via binary spikes distributed over time, have emerged as a promising energy efficient ML paradigm for resource-constrained devices. However, the current state-of-the-art (SOTA) SNNs require multiple time steps for acceptable inference accuracy, increasing spiking activity and, consequently, energy consumption. SOTA training strategies for SNNs involve conversion from a non-spiking deep neural network (DNN). In this paper, we determine that SOTA conversion strategies cannot yield ultra low latency because they incorrectly assume that the DNN and SNN pre-activation values are uniformly distributed. We propose a new training algorithm that accurately captures these distributions, minimizing the error between the DNN and converted SNN. The resulting SNNs have ultra low latency and high activation sparsity, yielding significant improvements in compute efficiency. In particular, we evaluate our framework on image recognition tasks from CIFAR-10 and CIFAR-100 datasets on several VGG and ResNet architectures. We obtain top-1 accuracy of 64.19% with only 2 time steps on the CIFAR-100 dataset with ~159.2x lower compute energy compared to an iso-architecture standard DNN. Compared to other SOTA SNN models, our models perform inference 2.5-8x faster (i.e., with fewer time steps).

View on arXiv PDF

Similar