CR IT LG MLJul 9, 2022

The Poisson binomial mechanism for secure and private federated learning

Wei-Ning Chen, Ayfer Özgür, Peter Kairouz

arXiv:2207.09916v15.25 citationsh-index: 38

Originality Highly original

AI Analysis

This addresses privacy and efficiency challenges in federated learning for distributed systems, offering a novel discrete mechanism that outperforms prior additive noise schemes in high-privacy or low-communication settings.

The paper tackles the problem of secure and private federated learning by introducing the Poisson Binomial mechanism (PBM) for distributed mean estimation, achieving the same privacy-accuracy trade-offs as the continuous Gaussian mechanism while enabling unbiased estimation and reduced communication costs, with support decreasing as privacy increases.

We introduce the Poisson Binomial mechanism (PBM), a discrete differential privacy mechanism for distributed mean estimation (DME) with applications to federated learning and analytics. We provide a tight analysis of its privacy guarantees, showing that it achieves the same privacy-accuracy trade-offs as the continuous Gaussian mechanism. Our analysis is based on a novel bound on the Rényi divergence of two Poisson binomial distributions that may be of independent interest. Unlike previous discrete DP schemes based on additive noise, our mechanism encodes local information into a parameter of the binomial distribution, and hence the output distribution is discrete with bounded support. Moreover, the support does not increase as the privacy budget $\varepsilon \rightarrow 0$ as in the case of additive schemes which require the addition of more noise to achieve higher privacy; on the contrary, the support becomes smaller as $\varepsilon \rightarrow 0$. The bounded support enables us to combine our mechanism with secure aggregation (SecAgg), a multi-party cryptographic protocol, without the need of performing modular clipping which results in an unbiased estimator of the sum of the local vectors. This in turn allows us to apply it in the private FL setting and provide an upper bound on the convergence rate of the SGD algorithm. Moreover, since the support of the output distribution becomes smaller as $\varepsilon \rightarrow 0$, the communication cost of our scheme decreases with the privacy constraint $\varepsilon$, outperforming all previous distributed DP schemes based on additive noise in the high privacy or low communication regimes.

View on arXiv PDF

Similar