NESDJun 10, 2017

Bio-Inspired Multi-Layer Spiking Neural Network Extracts Discriminative Features from Speech Signals

arXiv:1706.03170v157 citations
Originality Incremental advance
AI Analysis

This work addresses speech recognition by enabling power-efficient feature extraction, though it is incremental as it builds on existing SNN methods.

The paper tackled extracting discriminative features from speech signals using a bio-inspired spiking neural network with unsupervised learning, achieving over 96% accuracy in spoken digit recognition.

Spiking neural networks (SNNs) enable power-efficient implementations due to their sparse, spike-based coding scheme. This paper develops a bio-inspired SNN that uses unsupervised learning to extract discriminative features from speech signals, which can subsequently be used in a classifier. The architecture consists of a spiking convolutional/pooling layer followed by a fully connected spiking layer for feature discovery. The convolutional layer of leaky, integrate-and-fire (LIF) neurons represents primary acoustic features. The fully connected layer is equipped with a probabilistic spike-timing-dependent plasticity learning rule. This layer represents the discriminative features through probabilistic, LIF neurons. To assess the discriminative power of the learned features, they are used in a hidden Markov model (HMM) for spoken digit recognition. The experimental results show performance above 96% that compares favorably with popular statistical feature extraction methods. Our results provide a novel demonstration of unsupervised feature acquisition in an SNN.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes