MLCRITLGSTMar 15, 2021

A Central Limit Theorem for Differentially Private Query Answering

arXiv:2103.08721v120 citations
Originality Incremental advance
AI Analysis

This work addresses a foundational problem in differential privacy for data analysts, providing theoretical insights into optimal noise mechanisms, though it is incremental by building on existing bounds.

The paper tackles the problem of optimizing noise distributions for differentially private query answering in high dimensions, proving a central limit theorem that shows mechanisms become approximately Gaussian under certain conditions and establishing an optimal trade-off where the Gaussian mechanism achieves sharp constant-factor optimality.

Perhaps the single most important use case for differential privacy is to privately answer numerical queries, which is usually achieved by adding noise to the answer vector. The central question, therefore, is to understand which noise distribution optimizes the privacy-accuracy trade-off, especially when the dimension of the answer vector is high. Accordingly, extensive literature has been dedicated to the question and the upper and lower bounds have been matched up to constant factors [BUV18, SU17]. In this paper, we take a novel approach to address this important optimality question. We first demonstrate an intriguing central limit theorem phenomenon in the high-dimensional regime. More precisely, we prove that a mechanism is approximately Gaussian Differentially Private [DRS21] if the added noise satisfies certain conditions. In particular, densities proportional to $\mathrm{e}^{-\|x\|_p^α}$, where $\|x\|_p$ is the standard $\ell_p$-norm, satisfies the conditions. Taking this perspective, we make use of the Cramer--Rao inequality and show an "uncertainty principle"-style result: the product of the privacy parameter and the $\ell_2$-loss of the mechanism is lower bounded by the dimension. Furthermore, the Gaussian mechanism achieves the constant-sharp optimal privacy-accuracy trade-off among all such noises. Our findings are corroborated by numerical experiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes