CVOct 10, 2020

Accelerate CNNs from Three Dimensions: A Comprehensive Pruning Framework

arXiv:2010.04879v365 citations
Originality Incremental advance
AI Analysis

This addresses the issue of accuracy degradation in pruned neural networks for researchers and practitioners in deep learning, offering an incremental improvement over existing one-dimensional pruning methods.

The paper tackles the problem of excessive accuracy loss in neural network pruning by proposing a comprehensive framework that prunes along three dimensions (depth, width, and resolution) simultaneously, formulating it as an optimization problem with polynomial regression and iterative fine-tuning to reduce data collection costs. The results show that the algorithm surpasses state-of-the-art pruning and neural architecture search-based algorithms in experiments.

Most neural network pruning methods, such as filter-level and layer-level prunings, prune the network model along one dimension (depth, width, or resolution) solely to meet a computational budget. However, such a pruning policy often leads to excessive reduction of that dimension, thus inducing a huge accuracy loss. To alleviate this issue, we argue that pruning should be conducted along three dimensions comprehensively. For this purpose, our pruning framework formulates pruning as an optimization problem. Specifically, it first casts the relationships between a certain model's accuracy and depth/width/resolution into a polynomial regression and then maximizes the polynomial to acquire the optimal values for the three dimensions. Finally, the model is pruned along the three optimal dimensions accordingly. In this framework, since collecting too much data for training the regression is very time-costly, we propose two approaches to lower the cost: 1) specializing the polynomial to ensure an accurate regression even with less training data; 2) employing iterative pruning and fine-tuning to collect the data faster. Extensive experiments show that our proposed algorithm surpasses state-of-the-art pruning algorithms and even neural architecture search-based algorithms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes