CVApr 25, 2022

Boosting Pruned Networks with Linear Over-parameterization

arXiv:2204.11444v35 citationsh-index: 69
Originality Incremental advance
AI Analysis

This addresses the problem of accuracy loss in pruned networks for efficient inference, offering a domain-specific solution that is incremental in nature.

The paper tackles the challenge of restoring accuracy in pruned neural networks after fine-tuning by proposing a method that linearly over-parameterizes compact layers to increase fine-tuning parameters and then re-parameterizes them post-training, achieving significant performance improvements over vanilla fine-tuning, especially at high pruning ratios on CIFAR-10 and ImageNet.

Structured pruning compresses neural networks by reducing channels (filters) for fast inference and low footprint at run-time. To restore accuracy after pruning, fine-tuning is usually applied to pruned networks. However, too few remaining parameters in pruned networks inevitably bring a great challenge to fine-tuning to restore accuracy. To address this challenge, we propose a novel method that first linearly over-parameterizes the compact layers in pruned networks to enlarge the number of fine-tuning parameters and then re-parameterizes them to the original layers after fine-tuning. Specifically, we equivalently expand the convolution/linear layer with several consecutive convolution/linear layers that do not alter the current output feature maps. Furthermore, we utilize similarity-preserving knowledge distillation that encourages the over-parameterized block to learn the immediate data-to-data similarities of the corresponding dense layer to maintain its feature learning ability. The proposed method is comprehensively evaluated on CIFAR-10 and ImageNet which significantly outperforms the vanilla fine-tuning strategy, especially for large pruning ratio.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes