OC LGSep 8, 2020

Alternating Direction Method of Multipliers for Quantization

Tianjian Huang, Prajwal Singhania, Maziar Sanjabi, Pabitra Mitra, Meisam Razaviyayn

arXiv:2009.03482v24.713 citations

Originality Incremental advance

AI Analysis

This addresses quantization challenges for efficient model deployment, but it is incremental as it adapts an existing method to a specific domain.

The paper tackles the problem of quantizing machine learning model parameters by applying the Alternating Direction Method of Multipliers (ADMM) to discrete optimization, establishing convergence to stationary points and developing variants with improved performance.

Quantization of the parameters of machine learning models, such as deep neural networks, requires solving constrained optimization problems, where the constraint set is formed by the Cartesian product of many simple discrete sets. For such optimization problems, we study the performance of the Alternating Direction Method of Multipliers for Quantization ($\texttt{ADMM-Q}$) algorithm, which is a variant of the widely-used ADMM method applied to our discrete optimization problem. We establish the convergence of the iterates of $\texttt{ADMM-Q}$ to certain $\textit{stationary points}$. To the best of our knowledge, this is the first analysis of an ADMM-type method for problems with discrete variables/constraints. Based on our theoretical insights, we develop a few variants of $\texttt{ADMM-Q}$ that can handle inexact update rules, and have improved performance via the use of "soft projection" and "injecting randomness to the algorithm". We empirically evaluate the efficacy of our proposed approaches.

View on arXiv PDF

Similar