DS LGJul 4, 2023

Fast Private Kernel Density Estimation via Locality Sensitive Quantization

arXiv:2307.01877v17.311 citationsh-index: 27Has Code

Originality Highly original

AI Analysis

This work addresses the challenge of scalable private data analysis for high-dimensional datasets, representing a significant advancement over prior methods.

The paper tackles the problem of efficiently approximating kernel density estimation under differential privacy, breaking the exponential time barrier in high dimensions and achieving linear time complexity, with experiments showing fast and accurate performance on large datasets.

We study efficient mechanisms for differentially private kernel density estimation (DP-KDE). Prior work for the Gaussian kernel described algorithms that run in time exponential in the number of dimensions $d$. This paper breaks the exponential barrier, and shows how the KDE can privately be approximated in time linear in $d$, making it feasible for high-dimensional data. We also present improved bounds for low-dimensional data. Our results are obtained through a general framework, which we term Locality Sensitive Quantization (LSQ), for constructing private KDE mechanisms where existing KDE approximation techniques can be applied. It lets us leverage several efficient non-private KDE methods -- like Random Fourier Features, the Fast Gauss Transform, and Locality Sensitive Hashing -- and ``privatize'' them in a black-box manner. Our experiments demonstrate that our resulting DP-KDE mechanisms are fast and accurate on large datasets in both high and low dimensions.

View on arXiv PDF Code

Similar