Private Statistical Estimation of Many Quantiles
This addresses the need for private data generation by providing methods to estimate quantiles with differential privacy, but it is incremental as it builds on existing algorithms and tradeoffs.
The paper tackles the problem of estimating many statistical quantiles under differential privacy, comparing two approaches: private empirical quantile estimation and density estimation. It finds that for estimating many quantiles, density estimation is more effective than pointwise quantile estimation.
This work studies the estimation of many statistical quantiles under differential privacy. More precisely, given a distribution and access to i.i.d. samples from it, we study the estimation of the inverse of its cumulative distribution function (the quantile function) at specific points. For instance, this task is of key importance in private data generation. We present two different approaches. The first one consists in privately estimating the empirical quantiles of the samples and using this result as an estimator of the quantiles of the distribution. In particular, we study the statistical properties of the recently published algorithm introduced by Kaplan et al. 2022 that privately estimates the quantiles recursively. The second approach is to use techniques of density estimation in order to uniformly estimate the quantile function on an interval. In particular, we show that there is a tradeoff between the two methods. When we want to estimate many quantiles, it is better to estimate the density rather than estimating the quantile function at specific points.