LGMLNov 19, 2016

Pruning Convolutional Neural Networks for Resource Efficient Inference

arXiv:1611.06440v2436 citations
Originality Incremental advance
AI Analysis

This work addresses resource efficiency for deep learning practitioners by providing an incremental improvement in pruning methods for transfer learning scenarios.

The authors tackled the problem of pruning convolutional neural networks for efficient inference by proposing a Taylor expansion-based criterion, achieving over 10x theoretical reduction in filters with minimal accuracy drop in tasks like Birds-200 and Flowers-102.

We propose a new formulation for pruning convolutional kernels in neural networks to enable efficient inference. We interleave greedy criteria-based pruning with fine-tuning by backpropagation - a computationally efficient procedure that maintains good generalization in the pruned network. We propose a new criterion based on Taylor expansion that approximates the change in the cost function induced by pruning network parameters. We focus on transfer learning, where large pretrained networks are adapted to specialized tasks. The proposed criterion demonstrates superior performance compared to other criteria, e.g. the norm of kernel weights or feature map activation, for pruning large CNNs after adaptation to fine-grained classification tasks (Birds-200 and Flowers-102) relaying only on the first order gradient information. We also show that pruning can lead to more than 10x theoretical (5x practical) reduction in adapted 3D-convolutional filters with a small drop in accuracy in a recurrent gesture classifier. Finally, we show results for the large-scale ImageNet dataset to emphasize the flexibility of our approach.

Code Implementations8 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes