CR AI LG MLOct 3, 2021

Dirichlet Mechanism for Differentially Private KL Divergence Minimization

arXiv:2110.01984v33 citations

Originality Incremental advance

AI Analysis

This work addresses privacy-preserving statistical estimation for data analysts, offering an incremental improvement over existing mechanisms.

The paper tackles the problem of minimizing KL divergence for sensitive data while ensuring differential privacy, proposing a Dirichlet mechanism-based Rényi differentially private algorithm that shows advantages over Gaussian and Laplace mechanisms in experiments on real-world datasets.

Given an empirical distribution $f(x)$ of sensitive data $x$, we consider the task of minimizing $F(y) = D_{\text{KL}} (f(x)\Vert y)$ over a probability simplex, while protecting the privacy of $x$. We observe that, if we take the exponential mechanism and use the KL divergence as the loss function, then the resulting algorithm is the Dirichlet mechanism that outputs a single draw from a Dirichlet distribution. Motivated by this, we propose a Rényi differentially private (RDP) algorithm that employs the Dirichlet mechanism to solve the KL divergence minimization task. In addition, given $f(x)$ as above and $\hat{y}$ an output of the Dirichlet mechanism, we prove a probability tail bound on $D_{\text{KL}} (f(x)\Vert \hat{y})$, which is then used to derive a lower bound for the sample complexity of our RDP algorithm. Experiments on real-world datasets demonstrate advantages of our algorithm over Gaussian and Laplace mechanisms in supervised classification and maximum likelihood estimation.

View on arXiv PDF

Similar