LG AIOct 13, 2024

Gradient-Free Training of Quantized Neural Networks

Noa Cohen, Omkar Joglekar, Dotan Di Castro, Vladimir Tchuiev, Shir Kozlovsky, Michal Moshkovitz

arXiv:2410.09734v24.61 citationsh-index: 6

Originality Highly original

AI Analysis

This addresses the energy and computational efficiency problem for deploying neural networks in resource-constrained environments, representing a paradigm shift rather than an incremental improvement.

The paper tackles the problem of computationally expensive gradient-based training for quantized neural networks by proposing a gradient-free optimization framework, achieving performance comparable to full-precision training while using up to 3x less energy and requiring up to 5x fewer parameter updates.

Training neural networks requires significant computational resources and energy. Methods like mixed-precision and quantization-aware training reduce bit usage, yet they still depend heavily on computationally expensive gradient-based optimization. In this work, we propose a paradigm shift: eliminate gradients altogether. One might hope that, in a finite quantized space, finding optimal weights with out gradients would be easier but we theoretically prove that this problem is NP-hard even in simple settings where the continuous case is efficiently solvable. To address this, we introduce a novel heuristic optimization framework that avoids full weight updates and significantly improves efficiency. Empirically, our method achieves performance comparable to that of full-precision gradient-based training on standard datasets and architectures, while using up to 3x less energy and requiring up to 5x fewer parameter updates.

View on arXiv PDF

Similar