NE CV LG NCOct 6, 2021

Spike-inspired Rank Coding for Fast and Accurate Recurrent Neural Networks

Alan Jeffares, Qinghai Guo, Pontus Stenetorp, Timoleon Moraitis

arXiv:2110.02865v39.017 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the need for faster and more efficient recurrent neural networks in applications like real-time sequence processing, though it is incremental as it adapts existing methods to new coding schemes.

The paper tackles the problem of slow inference in recurrent neural networks by introducing rank coding, a temporal coding method inspired by spiking neural networks, which allows early stopping during inference and training, achieving 99.19% accuracy on a temporally-encoded MNIST dataset after the first time-step and outperforming state-of-the-art methods in spoken-word classification.

Biological spiking neural networks (SNNs) can temporally encode information in their outputs, e.g. in the rank order in which neurons fire, whereas artificial neural networks (ANNs) conventionally do not. As a result, models of SNNs for neuromorphic computing are regarded as potentially more rapid and efficient than ANNs when dealing with temporal input. On the other hand, ANNs are simpler to train, and usually achieve superior performance. Here we show that temporal coding such as rank coding (RC) inspired by SNNs can also be applied to conventional ANNs such as LSTMs, and leads to computational savings and speedups. In our RC for ANNs, we apply backpropagation through time using the standard real-valued activations, but only from a strategically early time step of each sequential input example, decided by a threshold-crossing event. Learning then incorporates naturally also *when* to produce an output, without other changes to the model or the algorithm. Both the forward and the backward training pass can be significantly shortened by skipping the remaining input sequence after that first event. RC-training also significantly reduces time-to-insight during inference, with a minimal decrease in accuracy. The desired speed-accuracy trade-off is tunable by varying the threshold or a regularization parameter that rewards output entropy. We demonstrate these in two toy problems of sequence classification, and in a temporally-encoded MNIST dataset where our RC model achieves 99.19% accuracy after the first input time-step, outperforming the state of the art in temporal coding with SNNs, as well as in spoken-word classification of Google Speech Commands, outperforming non-RC-trained early inference with LSTMs.

View on arXiv PDF Code

Similar