Õptimal Differentially Private Learning of Thresholds and Quasi-Concave Optimization
This solves a fundamental problem in private machine learning with broad implications for data analysis, though it is incremental in refining bounds.
The paper tackles the problem of learning threshold functions under differential privacy, closing the gap for approximate-DP by providing a nearly tight upper bound of ˜O(log* |X|), which matches a known lower bound and improves over prior work, and also gives matching bounds for private quasi-concave optimization.
The problem of learning threshold functions is a fundamental one in machine learning. Classical learning theory implies sample complexity of $O(ξ^{-1} \log(1/β))$ (for generalization error $ξ$ with confidence $1-β$). The private version of the problem, however, is more challenging and in particular, the sample complexity must depend on the size $|X|$ of the domain. Progress on quantifying this dependence, via lower and upper bounds, was made in a line of works over the past decade. In this paper, we finally close the gap for approximate-DP and provide a nearly tight upper bound of $\tilde{O}(\log^* |X|)$, which matches a lower bound by Alon et al (that applies even with improper learning) and improves over a prior upper bound of $\tilde{O}((\log^* |X|)^{1.5})$ by Kaplan et al. We also provide matching upper and lower bounds of $\tildeΘ(2^{\log^*|X|})$ for the additive error of private quasi-concave optimization (a related and more general problem). Our improvement is achieved via the novel Reorder-Slice-Compute paradigm for private data analysis which we believe will have further applications.