CVLGAug 31, 2016

Pruning Filters for Efficient ConvNets

arXiv:1608.08710v34090 citations
Originality Incremental advance
AI Analysis

This work addresses efficiency issues for deploying CNNs in resource-constrained environments, offering a practical acceleration method without requiring sparse libraries.

The paper tackles the problem of high computation and storage costs in CNNs by pruning filters with small effects on accuracy, reducing inference costs by up to 34% for VGG-16 and 38% for ResNet-110 on CIFAR10 while maintaining near-original accuracy.

The success of CNNs in various applications is accompanied by a significant increase in the computation and parameter storage costs. Recent efforts toward reducing these overheads involve pruning and compressing the weights of various layers without hurting original accuracy. However, magnitude-based pruning of weights reduces a significant number of parameters from the fully connected layers and may not adequately reduce the computation costs in the convolutional layers due to irregular sparsity in the pruned networks. We present an acceleration method for CNNs, where we prune filters from CNNs that are identified as having a small effect on the output accuracy. By removing whole filters in the network together with their connecting feature maps, the computation costs are reduced significantly. In contrast to pruning weights, this approach does not result in sparse connectivity patterns. Hence, it does not need the support of sparse convolution libraries and can work with existing efficient BLAS libraries for dense matrix multiplications. We show that even simple filter pruning techniques can reduce inference costs for VGG-16 by up to 34% and ResNet-110 by up to 38% on CIFAR10 while regaining close to the original accuracy by retraining the networks.

Code Implementations21 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes