LGNov 27, 2024

Pruning Deep Convolutional Neural Network Using Conditional Mutual Information

arXiv:2411.18578v1
Originality Incremental advance
AI Analysis

This work addresses model compression for efficient deployment in image classification, representing an incremental improvement over existing pruning techniques.

The paper tackles the problem of deploying large convolutional neural networks on resource-limited hardware by proposing a structured filter-pruning method based on conditional mutual information, which reduces the number of filters in VGG16 by over a third with only a 0.32% drop in accuracy on CIFAR-10.

Convolutional Neural Networks (CNNs) achieve high performance in image classification tasks but are challenging to deploy on resource-limited hardware due to their large model sizes. To address this issue, we leverage Mutual Information, a metric that provides valuable insights into how deep learning models retain and process information through measuring the shared information between input features or output labels and network layers. In this study, we propose a structured filter-pruning approach for CNNs that identifies and selectively retains the most informative features in each layer. Our approach successively evaluates each layer by ranking the importance of its feature maps based on Conditional Mutual Information (CMI) values, computed using a matrix-based Renyi α-order entropy numerical method. We propose several formulations of CMI to capture correlation among features across different layers. We then develop various strategies to determine the cutoff point for CMI values to prune unimportant features. This approach allows parallel pruning in both forward and backward directions and significantly reduces model size while preserving accuracy. Tested on the VGG16 architecture with the CIFAR-10 dataset, the proposed method reduces the number of filters by more than a third, with only a 0.32% drop in test accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes