LGCVMLJun 10, 2019

Network Implosion: Effective Model Compression for ResNets via Static Layer Pruning and Retraining

arXiv:1906.03826v11 citations
Originality Incremental advance
AI Analysis

This addresses efficiency for deep learning practitioners by offering a model compression technique that is incremental but provides specific gains.

The paper tackles the high computational cost of Residual Networks by proposing Network Implosion, a method that prunes and retrains layers to reduce the number of layers by 24.00 to 42.86 percent without accuracy drop on Cifar-10/100 and ImageNet.

Residual Networks with convolutional layers are widely used in the field of machine learning. Since they effectively extract features from input data by stacking multiple layers, they can achieve high accuracy in many applications. However, the stacking of many layers raises their computation costs. To address this problem, we propose Network Implosion, it erases multiple layers from Residual Networks without degrading accuracy. Our key idea is to introduce a priority term that identifies the importance of a layer; we can select unimportant layers according to the priority and erase them after the training. In addition, we retrain the networks to avoid critical drops in accuracy after layer erasure. A theoretical assessment reveals that our erasure and retraining scheme can erase layers without accuracy drop, and achieve higher accuracy than is possible with training from scratch. Our experiments show that Network Implosion can, for classification on Cifar-10/100 and ImageNet, reduce the number of layers by 24.00 to 42.86 percent without any drop in accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes