LG OCMay 23, 2025

Beyond Discreteness: Finite-Sample Analysis of Straight-Through Estimator for Quantization

arXiv:2505.18113v11 citationsh-index: 18

Originality Incremental advance

AI Analysis

This work addresses the theoretical gap in understanding STE for neural network quantization, which is crucial for efficient deep learning deployment, though it is incremental as it builds on existing heuristic use.

The paper tackles the problem of training quantized neural networks by providing the first finite-sample analysis of the straight-through estimator (STE), deriving a sample complexity bound that guarantees convergence to the global minimum for a two-layer network with binary weights and activations, and uncovering a recurrence property in the presence of label noise.

Training quantized neural networks requires addressing the non-differentiable and discrete nature of the underlying optimization problem. To tackle this challenge, the straight-through estimator (STE) has become the most widely adopted heuristic, allowing backpropagation through discrete operations by introducing surrogate gradients. However, its theoretical properties remain largely unexplored, with few existing works simplifying the analysis by assuming an infinite amount of training data. In contrast, this work presents the first finite-sample analysis of STE in the context of neural network quantization. Our theoretical results highlight the critical role of sample size in the success of STE, a key insight absent from existing studies. Specifically, by analyzing the quantization-aware training of a two-layer neural network with binary weights and activations, we derive the sample complexity bound in terms of the data dimensionality that guarantees the convergence of STE-based optimization to the global minimum. Moreover, in the presence of label noises, we uncover an intriguing recurrence property of STE-gradient method, where the iterate repeatedly escape from and return to the optimal binary weights. Our analysis leverages tools from compressed sensing and dynamical systems theory.

View on arXiv PDF

Similar