LGMLJun 16, 2020

Directional Pruning of Deep Neural Networks

arXiv:2006.09358v236 citationsHas Code
AI Analysis

This work addresses the challenge of efficiently compressing neural networks for deployment, though it is incremental as it builds on existing pruning and optimization techniques.

The paper tackles the problem of pruning deep neural networks by proposing a directional pruning method that finds sparse minimizers in flat loss regions without retraining or expert sparsity knowledge, achieving 92% sparsity on ResNet50 with ImageNet while maintaining similar computational efficiency to SGD.

In the light of the fact that the stochastic gradient descent (SGD) often finds a flat minimum valley in the training loss, we propose a novel directional pruning method which searches for a sparse minimizer in or close to that flat region. The proposed pruning method does not require retraining or the expert knowledge on the sparsity level. To overcome the computational formidability of estimating the flat directions, we propose to use a carefully tuned $\ell_1$ proximal gradient algorithm which can provably achieve the directional pruning with a small learning rate after sufficient training. The empirical results demonstrate the promising results of our solution in highly sparse regime (92% sparsity) among many existing pruning methods on the ResNet50 with the ImageNet, while using only a slightly higher wall time and memory footprint than the SGD. Using the VGG16 and the wide ResNet 28x10 on the CIFAR-10 and CIFAR-100, we demonstrate that our solution reaches the same minima valley as the SGD, and the minima found by our solution and the SGD do not deviate in directions that impact the training loss. The code that reproduces the results of this paper is available at https://github.com/donlan2710/gRDA-Optimizer/tree/master/directional_pruning.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes