NECVLGAug 16, 2016

Dynamic Network Surgery for Efficient DNNs

arXiv:1608.04493v21152 citationsHas Code
AI Analysis

This addresses the challenge of computational efficiency for mobile deployment, offering a novel approach to network compression.

The paper tackles the problem of deploying deep neural networks on mobile platforms by proposing dynamic network surgery, a compression method that reduces parameters in LeNet-5 by 108x and AlexNet by 17.7x without accuracy loss.

Deep learning has become a ubiquitous technology to improve machine intelligence. However, most of the existing deep models are structurally very complex, making them difficult to be deployed on the mobile platforms with limited computational power. In this paper, we propose a novel network compression method called dynamic network surgery, which can remarkably reduce the network complexity by making on-the-fly connection pruning. Unlike the previous methods which accomplish this task in a greedy way, we properly incorporate connection splicing into the whole process to avoid incorrect pruning and make it as a continual network maintenance. The effectiveness of our method is proved with experiments. Without any accuracy loss, our method can efficiently compress the number of parameters in LeNet-5 and AlexNet by a factor of $\bm{108}\times$ and $\bm{17.7}\times$ respectively, proving that it outperforms the recent pruning method by considerable margins. Code and some models are available at https://github.com/yiwenguo/Dynamic-Network-Surgery.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes