LG CVAug 27, 2025

Progressive Element-wise Gradient Estimation for Neural Network Quantization

arXiv:2509.00097v11 citationsh-index: 1

Originality Incremental advance

AI Analysis

This addresses the challenge of deploying deep neural networks on resource-constrained hardware by enhancing quantization-aware training, though it appears incremental as it builds on existing methods.

The paper tackles the problem of accuracy degradation in neural network quantization at low bit-widths by proposing Progressive Element-wise Gradient Estimation (PEGE), which replaces the Straight-Through Estimator and improves quantized model accuracy, achieving results that match or exceed full-precision models on datasets like CIFAR-10 and ImageNet.

Neural network quantization aims to reduce the bit-widths of weights and activations, making it a critical technique for deploying deep neural networks on resource-constrained hardware. Most Quantization-Aware Training (QAT) methods rely on the Straight-Through Estimator (STE) to address the non-differentiability of discretization functions by replacing their derivatives with that of the identity function. While effective, STE overlooks discretization errors between continuous and quantized values, which can lead to accuracy degradation -- especially at extremely low bit-widths. In this paper, we propose Progressive Element-wise Gradient Estimation (PEGE), a simple yet effective alternative to STE, which can be seamlessly integrated with any forward propagation methods and improves the quantized model accuracy. PEGE progressively replaces full-precision weights and activations with their quantized counterparts via a novel logarithmic curriculum-driven mixed-precision replacement strategy. Then it formulates QAT as a co-optimization problem that simultaneously minimizes the task loss for prediction and the discretization error for quantization, providing a unified and generalizable framework. Extensive experiments on CIFAR-10 and ImageNet across various architectures (e.g., ResNet, VGG) demonstrate that PEGE consistently outperforms existing backpropagation methods and enables low-precision models to match or even outperform the accuracy of their full-precision counterparts.

View on arXiv PDF

Similar