STCRLGMEMLOct 29, 2023

Differentially Private Permutation Tests: Applications to Kernel Methods

arXiv:2310.19043v213 citationsh-index: 8
Originality Incremental advance
AI Analysis

This work addresses privacy concerns for sensitive data analysis in statistical testing, offering a practical and efficient solution that is incremental by extending classical non-private methods to private settings.

The paper tackles the problem of privacy in hypothesis testing by introducing differentially private permutation tests, achieving minimax optimal power across privacy regimes with competitive empirical performance on synthetic and real-world data.

Recent years have witnessed growing concerns about the privacy of sensitive data. In response to these concerns, differential privacy has emerged as a rigorous framework for privacy protection, gaining widespread recognition in both academic and industrial circles. While substantial progress has been made in private data analysis, existing methods often suffer from impracticality or a significant loss of statistical efficiency. This paper aims to alleviate these concerns in the context of hypothesis testing by introducing differentially private permutation tests. The proposed framework extends classical non-private permutation tests to private settings, maintaining both finite-sample validity and differential privacy in a rigorous manner. The power of the proposed test depends on the choice of a test statistic, and we establish general conditions for consistency and non-asymptotic uniform power. To demonstrate the utility and practicality of our framework, we focus on reproducing kernel-based test statistics and introduce differentially private kernel tests for two-sample and independence testing: dpMMD and dpHSIC. The proposed kernel tests are straightforward to implement, applicable to various types of data, and attain minimax optimal power across different privacy regimes. Our empirical evaluations further highlight their competitive power under various synthetic and real-world scenarios, emphasizing their practical value. The code is publicly available to facilitate the implementation of our framework.

Code Implementations4 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes