LGNEJun 23, 2023

QNNRepair: Quantized Neural Network Repair

arXiv:2306.13793v33 citationsh-index: 23
Originality Incremental advance
AI Analysis

This addresses the performance degradation issue in quantized neural networks for deployment in resource-constrained environments, representing a novel but incremental advancement.

The paper tackles the problem of repairing quantized neural networks (QNNs) to improve accuracy after quantization, resulting in repaired models that achieve 24% higher accuracy than a state-of-the-art method on the ImageNet dataset.

We present QNNRepair, the first method in the literature for repairing quantized neural networks (QNNs). QNNRepair aims to improve the accuracy of a neural network model after quantization. It accepts the full-precision and weight-quantized neural networks and a repair dataset of passing and failing tests. At first, QNNRepair applies a software fault localization method to identify the neurons that cause performance degradation during neural network quantization. Then, it formulates the repair problem into a linear programming problem of solving neuron weights parameters, which corrects the QNN's performance on failing tests while not compromising its performance on passing tests. We evaluate QNNRepair with widely used neural network architectures such as MobileNetV2, ResNet, and VGGNet on popular datasets, including high-resolution images. We also compare QNNRepair with the state-of-the-art data-free quantization method SQuant. According to the experiment results, we conclude that QNNRepair is effective in improving the quantized model's performance in most cases. Its repaired models have 24% higher accuracy than SQuant's in the independent validation set, especially for the ImageNet dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes