CRAILGMLOct 3, 2021

Dirichlet Mechanism for Differentially Private KL Divergence Minimization

arXiv:2110.01984v33 citations
Originality Incremental advance
AI Analysis

This work addresses privacy-preserving statistical estimation for data analysts, offering an incremental improvement over existing mechanisms.

The paper tackles the problem of minimizing KL divergence for sensitive data while ensuring differential privacy, proposing a Dirichlet mechanism-based Rényi differentially private algorithm that shows advantages over Gaussian and Laplace mechanisms in experiments on real-world datasets.

Given an empirical distribution $f(x)$ of sensitive data $x$, we consider the task of minimizing $F(y) = D_{\text{KL}} (f(x)\Vert y)$ over a probability simplex, while protecting the privacy of $x$. We observe that, if we take the exponential mechanism and use the KL divergence as the loss function, then the resulting algorithm is the Dirichlet mechanism that outputs a single draw from a Dirichlet distribution. Motivated by this, we propose a Rényi differentially private (RDP) algorithm that employs the Dirichlet mechanism to solve the KL divergence minimization task. In addition, given $f(x)$ as above and $\hat{y}$ an output of the Dirichlet mechanism, we prove a probability tail bound on $D_{\text{KL}} (f(x)\Vert \hat{y})$, which is then used to derive a lower bound for the sample complexity of our RDP algorithm. Experiments on real-world datasets demonstrate advantages of our algorithm over Gaussian and Laplace mechanisms in supervised classification and maximum likelihood estimation.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes