CRMar 8, 2021

Efficient Error Prediction for Differentially Private Algorithms

arXiv:2103.04816v23.8

Originality Synthesis-oriented

AI Analysis

This work addresses the accuracy/privacy trade-off in differential privacy for data collection applications, but it is incremental as it builds on existing factor experiment methods.

The paper tackles the challenge of predicting accuracy loss in differentially private algorithms by proposing a data-aware error prediction method using factor experiments, demonstrated through a case study on tree-structured data with a least squares model.

Differential privacy is a strong mathematical notion of privacy. Still, a prominent challenge when using differential privacy in real data collection is understanding and counteracting the accuracy loss that differential privacy imposes. As such, the accuracy/privacy trade-off of differential privacy needs to be balanced on a case-by-case basis. Applications in the literature tend to focus solely on analytical accuracy bounds, not include data in error prediction, or use arbitrary settings to measure error empirically. To fill the gap in the literature, we propose a novel application of factor experiments to create data aware error predictions. Basically, factor experiments provide a systematic approach to conducting empirical experiments. To demonstrate our methodology in action, we conduct a case study where error is dependent on arbitrarily complex tree structures. We first construct a tool to simulate poll data. Next, we use our simulated data to construct a least squares model to predict error. Last, we show how to validate the model. Consequently, our contribution is a method for constructing error prediction models that are data aware.

View on arXiv PDF

Similar