CVAug 22, 2018

Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks

Yang He, Xuanyi Dong, Guoliang Kang, Yanwei Fu, Chenggang Yan, Yi Yang

arXiv:1808.07471v416.5146 citations

Originality Incremental advance

AI Analysis

This work addresses the computational cost of over-parameterized CNNs for image classification, presenting an incremental improvement over existing pruning methods.

The paper tackles the problem of accelerating deep convolutional neural networks by proposing an Asymptotic Soft Filter Pruning (ASFP) method that updates pruned filters during retraining and prunes asymptotically to reduce information loss. On ILSVRC-2012, ASFP reduces over 40% FLOPs on ResNet-50 with only 0.14% top-5 accuracy degradation, outperforming soft filter pruning by 8%.

Deeper and wider Convolutional Neural Networks (CNNs) achieve superior performance but bring expensive computation cost. Accelerating such over-parameterized neural network has received increased attention. A typical pruning algorithm is a three-stage pipeline, i.e., training, pruning, and retraining. Prevailing approaches fix the pruned filters to zero during retraining, and thus significantly reduce the optimization space. Besides, they directly prune a large number of filters at first, which would cause unrecoverable information loss. To solve these problems, we propose an Asymptotic Soft Filter Pruning (ASFP) method to accelerate the inference procedure of the deep neural networks. First, we update the pruned filters during the retraining stage. As a result, the optimization space of the pruned model would not be reduced but be the same as that of the original model. In this way, the model has enough capacity to learn from the training data. Second, we prune the network asymptotically. We prune few filters at first and asymptotically prune more filters during the training procedure. With asymptotic pruning, the information of the training set would be gradually concentrated in the remaining filters, so the subsequent training and pruning process would be stable. Experiments show the effectiveness of our ASFP on image classification benchmarks. Notably, on ILSVRC-2012, our ASFP reduces more than 40% FLOPs on ResNet-50 with only 0.14% top-5 accuracy degradation, which is higher than the soft filter pruning (SFP) by 8%.

View on arXiv PDF

Similar