DSCRITLGMLNov 27, 2018

The Structure of Optimal Private Tests for Simple Hypotheses

arXiv:1811.11148v289 citations
Originality Highly original
AI Analysis

This work addresses a fundamental problem in statistical inference for settings with privacy constraints, such as data analysis, by providing optimal private tests, though it is incremental as it builds on classical hypothesis testing frameworks.

The paper tackles the problem of determining the optimal sample complexity for privately testing simple hypotheses under differential privacy, characterizing it up to constant factors and showing it is achieved by a clamped log-likelihood ratio test, with an application to private change-point detection.

Hypothesis testing plays a central role in statistical inference, and is used in many settings where privacy concerns are paramount. This work answers a basic question about privately testing simple hypotheses: given two distributions $P$ and $Q$, and a privacy level $\varepsilon$, how many i.i.d. samples are needed to distinguish $P$ from $Q$ subject to $\varepsilon$-differential privacy, and what sort of tests have optimal sample complexity? Specifically, we characterize this sample complexity up to constant factors in terms of the structure of $P$ and $Q$ and the privacy level $\varepsilon$, and show that this sample complexity is achieved by a certain randomized and clamped variant of the log-likelihood ratio test. Our result is an analogue of the classical Neyman-Pearson lemma in the setting of private hypothesis testing. We also give an application of our result to the private change-point detection. Our characterization applies more generally to hypothesis tests satisfying essentially any notion of algorithmic stability, which is known to imply strong generalization bounds in adaptive data analysis, and thus our results have applications even when privacy is not a primary concern.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes