LGMLDec 10, 2019

Magnitude and Uncertainty Pruning Criterion for Neural Networks

arXiv:1912.04845v12 citations
AI Analysis

This work addresses efficiency issues in neural networks for practitioners, but it is incremental as it builds on existing pruning methods.

The paper tackles the problem of overparameterization in neural networks, which causes computational and memory inefficiency and overfitting, by introducing a magnitude and uncertainty pruning criterion that achieves more compressed models with less loss in predictive power compared to magnitude-based pruning alone.

Neural networks have achieved dramatic improvements in recent years and depict the state-of-the-art methods for many real-world tasks nowadays. One drawback is, however, that many of these models are overparameterized, which makes them both computationally and memory intensive. Furthermore, overparameterization can also lead to undesired overfitting side-effects. Inspired by recently proposed magnitude-based pruning schemes and the Wald test from the field of statistics, we introduce a novel magnitude and uncertainty (M&U) pruning criterion that helps to lessen such shortcomings. One important advantage of our M&U pruning criterion is that it is scale-invariant, a phenomenon that the magnitude-based pruning criterion suffers from. In addition, we present a ``pseudo bootstrap'' scheme, which can efficiently estimate the uncertainty of the weights by using their update information during training. Our experimental evaluation, which is based on various neural network architectures and datasets, shows that our new criterion leads to more compressed models compared to models that are solely based on magnitude-based pruning criteria, with, at the same time, less loss in predictive power.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes