LGMLJun 17, 2018

Fast Convex Pruning of Deep Neural Networks

arXiv:1806.06457v226 citations
Originality Incremental advance
AI Analysis

This work addresses network compression for efficient deployment, but it is incremental as it builds on existing convex optimization and sparse regression methods.

The paper tackles the problem of simplifying trained neural networks by developing Net-Trim, a convex pruning technique that sparsifies networks layer by layer while preserving internal responses, with theoretical guarantees requiring O(s log N/s) samples to find weights using at most s nonzero terms.

We develop a fast, tractable technique called Net-Trim for simplifying a trained neural network. The method is a convex post-processing module, which prunes (sparsifies) a trained network layer by layer, while preserving the internal responses. We present a comprehensive analysis of Net-Trim from both the algorithmic and sample complexity standpoints, centered on a fast, scalable convex optimization program. Our analysis includes consistency results between the initial and retrained models before and after Net-Trim application and guarantees on the number of training samples needed to discover a network that can be expressed using a certain number of nonzero terms. Specifically, if there is a set of weights that uses at most $s$ terms that can re-create the layer outputs from the layer inputs, we can find these weights from $\mathcal{O}(s\log N/s)$ samples, where $N$ is the input size. These theoretical results are similar to those for sparse regression using the Lasso, and our analysis uses some of the same recently-developed tools (namely recent results on the concentration of measure and convex analysis). Finally, we propose an algorithmic framework based on the alternating direction method of multipliers (ADMM), which allows a fast and simple implementation of Net-Trim for network pruning and compression.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes