CVOct 28, 2018

Discrimination-aware Channel Pruning for Deep Neural Networks

Zhuangwei Zhuang, Mingkui Tan, Bohan Zhuang, Jing Liu, Yong Guo, Qingyao Wu, Junzhou Huang, Jinhui Zhu

arXiv:1810.11809v338.3635 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of model compression for deep learning practitioners by improving pruning efficiency and accuracy, though it is incremental as it builds on existing pruning strategies.

The paper tackles the problem of channel pruning in deep neural networks by introducing a discrimination-aware method that selects channels based on discriminative power and reconstruction error, resulting in a pruned ResNet-50 with 30% channel reduction that outperforms the original model by 0.39% in top-1 accuracy on ILSVRC-12.

Channel pruning is one of the predominant approaches for deep model compression. Existing pruning methods either train from scratch with sparsity constraints on channels, or minimize the reconstruction error between the pre-trained feature maps and the compressed ones. Both strategies suffer from some limitations: the former kind is computationally expensive and difficult to converge, whilst the latter kind optimizes the reconstruction error but ignores the discriminative power of channels. To overcome these drawbacks, we investigate a simple-yet-effective method, called discrimination-aware channel pruning, to choose those channels that really contribute to discriminative power. To this end, we introduce additional losses into the network to increase the discriminative power of intermediate layers and then select the most discriminative channels for each layer by considering the additional loss and the reconstruction error. Last, we propose a greedy algorithm to conduct channel selection and parameter optimization in an iterative way. Extensive experiments demonstrate the effectiveness of our method. For example, on ILSVRC-12, our pruned ResNet-50 with 30% reduction of channels even outperforms the original model by 0.39% in top-1 accuracy.

View on arXiv PDF Code

Similar