NEAISep 4, 2021

Spiking Neural Networks with Improved Inherent Recurrence Dynamics for Sequential Learning

arXiv:2109.01905v161 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of making SNNs effective for sequential tasks like speech recognition, offering energy-efficient alternatives for edge devices, though it is incremental in improving existing SNN methods.

The paper tackles the problem of training spiking neural networks (SNNs) for sequential learning tasks by modifying LIF neurons to improve inherent recurrence dynamics and developing a training scheme with multi-bit outputs. The result is that the proposed SNN achieves accuracy comparable to LSTMs (within 1.10% on TIMIT and 0.36% on LibriSpeech 100h) with 2x fewer parameters and 10-11x savings in multiplication operations compared to GRUs.

Spiking neural networks (SNNs) with leaky integrate and fire (LIF) neurons, can be operated in an event-driven manner and have internal states to retain information over time, providing opportunities for energy-efficient neuromorphic computing, especially on edge devices. Note, however, many representative works on SNNs do not fully demonstrate the usefulness of their inherent recurrence (membrane potentials retaining information about the past) for sequential learning. Most of the works train SNNs to recognize static images by artificially expanded input representation in time through rate coding. We show that SNNs can be trained for sequential tasks and propose modifications to a network of LIF neurons that enable internal states to learn long sequences and make their inherent recurrence resilient to the vanishing gradient problem. We then develop a training scheme to train the proposed SNNs with improved inherent recurrence dynamics. Our training scheme allows spiking neurons to produce multi-bit outputs (as opposed to binary spikes) which help mitigate the mismatch between a derivative of spiking neurons' activation function and a surrogate derivative used to overcome spiking neurons' non-differentiability. Our experimental results indicate that the proposed SNN architecture on TIMIT and LibriSpeech 100h dataset yields accuracy comparable to that of LSTMs (within 1.10% and 0.36%, respectively), but with 2x fewer parameters than LSTMs. The sparse SNN outputs also lead to 10.13x and 11.14x savings in multiplication operations compared to GRUs, which is generally con-sidered as a lightweight alternative to LSTMs, on TIMIT and LibriSpeech 100h datasets, respectively.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes