CRLGMLSep 5, 2018

A Differentially Private Wilcoxon Signed-Rank Test

arXiv:1809.01635v11 citations
Originality Highly original
AI Analysis

This work addresses the need for privacy-preserving statistical tests in data mining and scientific research, offering a more efficient solution for analyzing paired data under differential privacy.

The authors tackled the problem of performing the Wilcoxon signed-rank hypothesis test under differential privacy, presenting a method that computes both a private test statistic and an accurate p-value. Their test requires less than half as much data as the only existing private alternative to achieve the same statistical power.

Hypothesis tests are a crucial statistical tool for data mining and are the workhorse of scientific research in many fields. Here we present a differentially private analogue of the classic Wilcoxon signed-rank hypothesis test, which is used when comparing sets of paired (e.g., before-and-after) data values. We present not only a private estimate of the test statistic, but a method to accurately compute a p-value and assess statistical significance. We evaluate our test on both simulated and real data. Compared to the only existing private test for this situation, that of Task and Clifton, we find that our test requires less than half as much data to achieve the same statistical power.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes