QUANT-PH CR LGFeb 18, 2024

Evaluating Efficacy of Model Stealing Attacks and Defenses on Quantum Neural Networks

Satwik Kundu, Debarshi Kundu, Swaroop Ghosh

arXiv:2402.11687v15.110 citationsh-index: 7GLSVLSI

Originality Incremental advance

AI Analysis

This addresses security vulnerabilities in cloud-hosted quantum machine learning models, but the defense results are incremental as they show limited effectiveness.

The study evaluated model stealing attacks on quantum neural networks, finding that attacks can achieve clone test accuracies up to 0.9× and 0.99× using different label strategies, and proposed defenses based on hardware noise that provided limited obfuscation but revealed inherent resilience in noisy-trained models.

Cloud hosting of quantum machine learning (QML) models exposes them to a range of vulnerabilities, the most significant of which is the model stealing attack. In this study, we assess the efficacy of such attacks in the realm of quantum computing. We conducted comprehensive experiments on various datasets with multiple QML model architectures. Our findings revealed that model stealing attacks can produce clone models achieving up to $0.9\times$ and $0.99\times$ clone test accuracy when trained using Top-$1$ and Top-$k$ labels, respectively ($k:$ num\_classes). To defend against these attacks, we leverage the unique properties of current noisy hardware and perturb the victim model outputs and hinder the attacker's training process. In particular, we propose: 1) hardware variation-induced perturbation (HVIP) and 2) hardware and architecture variation-induced perturbation (HAVIP). Although noise and architectural variability can provide up to $\sim16\%$ output obfuscation, our comprehensive analysis revealed that models cloned under noisy conditions tend to be resilient, suffering little to no performance degradation due to such obfuscations. Despite limited success with our defense techniques, this outcome has led to an important discovery: QML models trained on noisy hardwares are naturally resistant to perturbation or obfuscation-based defenses or attacks.

View on arXiv PDF

Similar