LGAIOct 13, 2024

Gradient-Free Training of Quantized Neural Networks

arXiv:2410.09734v21 citationsh-index: 6
Originality Highly original
AI Analysis

This addresses the energy and computational efficiency problem for deploying neural networks in resource-constrained environments, representing a paradigm shift rather than an incremental improvement.

The paper tackles the problem of computationally expensive gradient-based training for quantized neural networks by proposing a gradient-free optimization framework, achieving performance comparable to full-precision training while using up to 3x less energy and requiring up to 5x fewer parameter updates.

Training neural networks requires significant computational resources and energy. Methods like mixed-precision and quantization-aware training reduce bit usage, yet they still depend heavily on computationally expensive gradient-based optimization. In this work, we propose a paradigm shift: eliminate gradients altogether. One might hope that, in a finite quantized space, finding optimal weights with out gradients would be easier but we theoretically prove that this problem is NP-hard even in simple settings where the continuous case is efficiently solvable. To address this, we introduce a novel heuristic optimization framework that avoids full weight updates and significantly improves efficiency. Empirically, our method achieves performance comparable to that of full-precision gradient-based training on standard datasets and architectures, while using up to 3x less energy and requiring up to 5x fewer parameter updates.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes