LG AI CV NEJul 25, 2022

Trainability Preserving Neural Pruning

arXiv:2207.12534v314.129 citationsh-index: 17Has Code

Originality Incremental advance

AI Analysis

This addresses the issue of maintaining model performance after pruning for practitioners in deep learning, though it is incremental as it builds on existing pruning methods by focusing on trainability preservation.

The paper tackles the problem of neural network pruning degrading trainability, which can cause poor performance and sensitivity to hyperparameters, by introducing trainability preserving pruning (TPP) that penalizes filter correlations and regularizes batch normalization, resulting in improved pruning performance and robustness, with empirical gains on datasets like CIFAR10/100 and ImageNet-1K.

Many recent works have shown trainability plays a central role in neural network pruning -- unattended broken trainability can lead to severe under-performance and unintentionally amplify the effect of retraining learning rate, resulting in biased (or even misinterpreted) benchmark results. This paper introduces trainability preserving pruning (TPP), a scalable method to preserve network trainability against pruning, aiming for improved pruning performance and being more robust to retraining hyper-parameters (e.g., learning rate). Specifically, we propose to penalize the gram matrix of convolutional filters to decorrelate the pruned filters from the retained filters. In addition to the convolutional layers, per the spirit of preserving the trainability of the whole network, we also propose to regularize the batch normalization parameters (scale and bias). Empirical studies on linear MLP networks show that TPP can perform on par with the oracle trainability recovery scheme. On nonlinear ConvNets (ResNet56/VGG19) on CIFAR10/100, TPP outperforms the other counterpart approaches by an obvious margin. Moreover, results on ImageNet-1K with ResNets suggest that TPP consistently performs more favorably against other top-performing structured pruning approaches. Code: https://github.com/MingSun-Tse/TPP.

View on arXiv PDF Code

Similar