LGCRDSMLAug 20, 2018

Privacy Amplification by Iteration

arXiv:1808.06651v2203 citations
Originality Highly original
AI Analysis

This work addresses privacy concerns in iterative machine learning algorithms, offering a new analysis technique that improves privacy guarantees in settings where existing methods like privacy amplification by sampling are not applicable.

The paper tackles the problem of analyzing differential privacy for iterative learning algorithms by showing that not releasing intermediate results amplifies privacy guarantees for contractive iterations, and demonstrates applications in convex optimization where a small number of non-private data points can reduce the gap between private and non-private methods.

Many commonly used learning algorithms work by iteratively updating an intermediate solution using one or a few data points in each iteration. Analysis of differential privacy for such algorithms often involves ensuring privacy of each step and then reasoning about the cumulative privacy cost of the algorithm. This is enabled by composition theorems for differential privacy that allow releasing of all the intermediate results. In this work, we demonstrate that for contractive iterations, not releasing the intermediate results strongly amplifies the privacy guarantees. We describe several applications of this new analysis technique to solving convex optimization problems via noisy stochastic gradient descent. For example, we demonstrate that a relatively small number of non-private data points from the same distribution can be used to close the gap between private and non-private convex optimization. In addition, we demonstrate that we can achieve guarantees similar to those obtainable using the privacy-amplification-by-sampling technique in several natural settings where that technique cannot be applied.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes