LGOct 15, 2020

Layer-adaptive sparsity for the Magnitude-based Pruning

Jaeho Lee, Sejun Park, Sangwoo Mo, Sungsoo Ahn, Jinwoo Shin

arXiv:2010.07611v227.8347 citationsh-index: 39Has Code

Originality Highly original

AI Analysis

This addresses the challenge of efficient pruning for deep learning practitioners, offering a practical improvement over heuristic methods.

The paper tackles the problem of selecting layerwise sparsity in neural network pruning by proposing a novel importance score called LAMP, which consistently outperforms existing schemes in image classification setups, achieving state-of-the-art tradeoffs without hyperparameter tuning.

Recent discoveries on neural network pruning reveal that, with a carefully chosen layerwise sparsity, a simple magnitude-based pruning achieves state-of-the-art tradeoff between sparsity and performance. However, without a clear consensus on "how to choose," the layerwise sparsities are mostly selected algorithm-by-algorithm, often resorting to handcrafted heuristics or an extensive hyperparameter search. To fill this gap, we propose a novel importance score for global pruning, coined layer-adaptive magnitude-based pruning (LAMP) score; the score is a rescaled version of weight magnitude that incorporates the model-level $\ell_2$ distortion incurred by pruning, and does not require any hyperparameter tuning or heavy computation. Under various image classification setups, LAMP consistently outperforms popular existing schemes for layerwise sparsity selection. Furthermore, we observe that LAMP continues to outperform baselines even in weight-rewinding setups, while the connectivity-oriented layerwise sparsity (the strongest baseline overall) performs worse than a simple global magnitude-based pruning in this case. Code: https://github.com/jaeho-lee/layer-adaptive-sparsity

View on arXiv PDF Code

Similar