LGOct 31, 2024

Mutual Information Preserving Neural Network Pruning

arXiv:2411.00147v25 citationsh-index: 10
Originality Highly original
AI Analysis

This addresses the resource efficiency challenge for deploying large neural networks, offering a novel approach with proven re-trainability, though it is incremental as it builds on existing pruning frameworks.

The paper tackles the problem of neural network pruning by introducing Mutual Information Preserving Pruning (MIPP), a structured activation-based technique that conserves mutual information between layers to ensure re-trainability, and demonstrates that it consistently outperforms state-of-the-art methods in both pre- and post-training scenarios.

Pruning has emerged as the primary approach used to limit the resource requirements of large neural networks (NNs). Since the proposal of the lottery ticket hypothesis, researchers have focused either on pruning at initialization or after training. However, recent theoretical findings have shown that the sample efficiency of robust pruned models is proportional to the mutual information (MI) between the pruning masks and the model's training datasets, \textit{whether at initialization or after training}. In this paper, starting from these results, we introduce Mutual Information Preserving Pruning (MIPP), a structured activation-based pruning technique applicable before or after training. The core principle of MIPP is to select nodes in a way that conserves MI shared between the activations of adjacent layers, and consequently between the data and masks. Approaching the pruning problem in this manner means we can prove that there exists a function that can map the pruned upstream layer's activations to the downstream layer's, implying re-trainability. We demonstrate that MIPP consistently outperforms state-of-the-art methods, regardless of whether pruning is performed before or after training.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes