LG CR CVOct 31, 2024

ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs

Yuchen Yang, Shubham Ugare, Yifan Zhao, Gagandeep Singh, Sasa Misailovic

arXiv:2410.24214v22.63 citationsh-index: 28Has Code

Originality Highly original

AI Analysis

This addresses the challenge of efficient and robust deployment of DNNs for applications requiring security guarantees, representing a novel integration rather than an incremental improvement.

The paper tackles the problem of maintaining certified robustness in quantized deep neural networks, achieving performance comparable to floating-point models with only 1.5% instructions and the highest certified radius across benchmarks.

Mixed precision quantization has become an important technique for optimizing the execution of deep neural networks (DNNs). Certified robustness, which provides provable guarantees about a model's ability to withstand different adversarial perturbations, has rarely been addressed in quantization due to unacceptably high cost of certifying robustness. This paper introduces ARQ, an innovative mixed-precision quantization method that not only preserves the clean accuracy of the smoothed classifiers but also maintains their certified robustness. ARQ uses reinforcement learning to find accurate and robust DNN quantization, while efficiently leveraging randomized smoothing, a popular class of statistical DNN verification algorithms. ARQ consistently performs better than multiple state-of-the-art quantization techniques across all the benchmarks and the input perturbation levels. The performance of ARQ quantized networks reaches that of the original DNN with floating-point weights, but with only 1.5% instructions and the highest certified radius. ARQ code is available at https://anonymous.4open.science/r/ARQ-FE4B.

View on arXiv PDF

Similar