LGAug 8, 2024

Confident magnitude-based neural network pruning

arXiv:2408.04759v13 citationsh-index: 2
Originality Incremental advance
AI Analysis

This work addresses the need for reliable and safe deployment of sparse neural networks, particularly in computer vision, by introducing uncertainty-aware pruning methods.

The paper tackled the problem of providing rigorous uncertainty quantification for neural network pruning, achieving finite-sample statistical guarantees while maintaining high performance in computer vision tasks.

Pruning neural networks has proven to be a successful approach to increase the efficiency and reduce the memory storage of deep learning models without compromising performance. Previous literature has shown that it is possible to achieve a sizable reduction in the number of parameters of a deep neural network without deteriorating its predictive capacity in one-shot pruning regimes. Our work builds beyond this background in order to provide rigorous uncertainty quantification for pruning neural networks reliably, which has not been addressed to a great extent in previous literature focusing on pruning methods in computer vision settings. We leverage recent techniques on distribution-free uncertainty quantification to provide finite-sample statistical guarantees to compress deep neural networks, while maintaining high performance. Moreover, this work presents experiments in computer vision tasks to illustrate how uncertainty-aware pruning is a useful approach to deploy sparse neural networks safely.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes