LGCVMLMay 14, 2020

Bayesian Bits: Unifying Quantization and Pruning

arXiv:2005.07093v3141 citations
Originality Incremental advance
AI Analysis

This work addresses the need for more efficient neural network deployment, particularly in resource-constrained environments, by providing a unified approach to quantization and pruning, though it is incremental as it builds on existing gradient-based optimization techniques.

The authors tackled the problem of jointly optimizing quantization and pruning in neural networks, resulting in a method that learns pruned, mixed precision networks with a better trade-off between accuracy and efficiency compared to static bit width equivalents.

We introduce Bayesian Bits, a practical method for joint mixed precision quantization and pruning through gradient based optimization. Bayesian Bits employs a novel decomposition of the quantization operation, which sequentially considers doubling the bit width. At each new bit width, the residual error between the full precision value and the previously rounded value is quantized. We then decide whether or not to add this quantized residual error for a higher effective bit width and lower quantization noise. By starting with a power-of-two bit width, this decomposition will always produce hardware-friendly configurations, and through an additional 0-bit option, serves as a unified view of pruning and quantization. Bayesian Bits then introduces learnable stochastic gates, which collectively control the bit width of the given tensor. As a result, we can obtain low bit solutions by performing approximate inference over the gates, with prior distributions that encourage most of them to be switched off. We experimentally validate our proposed method on several benchmark datasets and show that we can learn pruned, mixed precision networks that provide a better trade-off between accuracy and efficiency than their static bit width equivalents.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes