LGCRMLJan 28, 2019

Bayesian Differential Privacy for Machine Learning

arXiv:1901.09697v525 citations
AI Analysis

This addresses the problem of excessive accuracy loss in privacy-preserving machine learning for practitioners, though it is incremental as it builds upon existing differential privacy frameworks.

The authors tackled the mismatch between traditional differential privacy and machine learning by proposing Bayesian differential privacy, which incorporates data distribution to offer more practical privacy guarantees, resulting in significantly stronger privacy for in-distribution samples on datasets like MNIST and CIFAR-10 while maintaining high accuracy.

Traditional differential privacy is independent of the data distribution. However, this is not well-matched with the modern machine learning context, where models are trained on specific data. As a result, achieving meaningful privacy guarantees in ML often excessively reduces accuracy. We propose Bayesian differential privacy (BDP), which takes into account the data distribution to provide more practical privacy guarantees. We also derive a general privacy accounting method under BDP, building upon the well-known moments accountant. Our experiments demonstrate that in-distribution samples in classic machine learning datasets, such as MNIST and CIFAR-10, enjoy significantly stronger privacy guarantees than postulated by DP, while models maintain high classification accuracy.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes