ML LGNov 8, 2019

Theoretical Guarantees for Model Auditing with Finite Adversaries

Mario Diaz, Peter Kairouz, Jiachun Liao, Lalitha Sankar

arXiv:1911.03405v13.22 citations

Originality Synthesis-oriented

AI Analysis

This work addresses privacy verification for machine learning models, but it appears incremental as it builds on existing auditing techniques without introducing a new method.

The paper tackles the problem of ensuring that model auditing with finite adversaries can guarantee no stronger adversary can identify privacy violations, by analyzing parameters like adversary capacity and side information.

Privacy concerns have led to the development of privacy-preserving approaches for learning models from sensitive data. Yet, in practice, even models learned with privacy guarantees can inadvertently memorize unique training examples or leak sensitive features. To identify such privacy violations, existing model auditing techniques use finite adversaries defined as machine learning models with (a) access to some finite side information (e.g., a small auditing dataset), and (b) finite capacity (e.g., a fixed neural network architecture). Our work investigates the requirements under which an unsuccessful attempt to identify privacy violations by a finite adversary implies that no stronger adversary can succeed at such a task. We do so via parameters that quantify the capabilities of the finite adversary, including the size of the neural network employed by such an adversary and the amount of side information it has access to as well as the regularity of the (perhaps privacy-guaranteeing) audited model.

View on arXiv PDF

Similar