CRLGMay 12, 2020

Perturbing Inputs to Prevent Model Stealing

arXiv:2005.05823v17 citations
Originality Incremental advance
AI Analysis

This addresses security concerns for cloud-deployed ML models, offering a defense mechanism against model stealing, though it is incremental as it builds on existing noise-based protection methods.

The paper tackles the problem of protecting machine learning models from parameter stealing attacks by strategically perturbing inputs to cloud-based ML services, showing that even with infinite samples, attackers cannot recover true model parameters, and characterizes the trade-off between attacker estimation error and service output error.

We show how perturbing inputs to machine learning services (ML-service) deployed in the cloud can protect against model stealing attacks. In our formulation, there is an ML-service that receives inputs from users and returns the output of the model. There is an attacker that is interested in learning the parameters of the ML-service. We use the linear and logistic regression models to illustrate how strategically adding noise to the inputs fundamentally alters the attacker's estimation problem. We show that even with infinite samples, the attacker would not be able to recover the true model parameters. We focus on characterizing the trade-off between the error in the attacker's estimate of the parameters with the error in the ML-service's output.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes