NE LGMar 17, 2019

A Brain-inspired Algorithm for Training Highly Sparse Neural Networks

Zahra Atashgahi, Joost Pieterse, Shiwei Liu, Decebal Constantin Mocanu, Raymond Veldhuis, Mykola Pechenizkiy

arXiv:1903.07138v320 citationsHas Code

Originality Highly original

AI Analysis

This addresses the computational inefficiency of pruning methods for low-resource devices, though it is an incremental improvement in sparse training algorithms.

The paper tackles the problem of training highly sparse neural networks from scratch by proposing a brain-inspired algorithm that evolves network topology based on neuron behavior, achieving superior performance over state-of-the-art methods in extremely sparse scenarios across eight datasets.

Sparse neural networks attract increasing interest as they exhibit comparable performance to their dense counterparts while being computationally efficient. Pruning the dense neural networks is among the most widely used methods to obtain a sparse neural network. Driven by the high training cost of such methods that can be unaffordable for a low-resource device, training sparse neural networks sparsely from scratch has recently gained attention. However, existing sparse training algorithms suffer from various issues, including poor performance in high sparsity scenarios, computing dense gradient information during training, or pure random topology search. In this paper, inspired by the evolution of the biological brain and the Hebbian learning theory, we present a new sparse training approach that evolves sparse neural networks according to the behavior of neurons in the network. Concretely, by exploiting the cosine similarity metric to measure the importance of the connections, our proposed method, Cosine similarity-based and Random Topology Exploration (CTRE), evolves the topology of sparse neural networks by adding the most important connections to the network without calculating dense gradient in the backward. We carried out different experiments on eight datasets, including tabular, image, and text datasets, and demonstrate that our proposed method outperforms several state-of-the-art sparse training algorithms in extremely sparse neural networks by a large gap. The implementation code is available on https://github.com/zahraatashgahi/CTRE

View on arXiv PDF Code

Similar