LG NEJun 23, 2023

QNNRepair: Quantized Neural Network Repair

Xidan Song, Youcheng Sun, Mustafa A. Mustafa, Lucas C. Cordeiro

arXiv:2306.13793v32.03 citationsh-index: 23

Originality Incremental advance

AI Analysis

This addresses the performance degradation issue in quantized neural networks for deployment in resource-constrained environments, representing a novel but incremental advancement.

The paper tackles the problem of repairing quantized neural networks (QNNs) to improve accuracy after quantization, resulting in repaired models that achieve 24% higher accuracy than a state-of-the-art method on the ImageNet dataset.

We present QNNRepair, the first method in the literature for repairing quantized neural networks (QNNs). QNNRepair aims to improve the accuracy of a neural network model after quantization. It accepts the full-precision and weight-quantized neural networks and a repair dataset of passing and failing tests. At first, QNNRepair applies a software fault localization method to identify the neurons that cause performance degradation during neural network quantization. Then, it formulates the repair problem into a linear programming problem of solving neuron weights parameters, which corrects the QNN's performance on failing tests while not compromising its performance on passing tests. We evaluate QNNRepair with widely used neural network architectures such as MobileNetV2, ResNet, and VGGNet on popular datasets, including high-resolution images. We also compare QNNRepair with the state-of-the-art data-free quantization method SQuant. According to the experiment results, we conclude that QNNRepair is effective in improving the quantized model's performance in most cases. Its repaired models have 24% higher accuracy than SQuant's in the independent validation set, especially for the ImageNet dataset.

View on arXiv PDF

Similar