CRPLSep 15, 2021

DPGen: Automated Program Synthesis for Differential Privacy

arXiv:2109.07441v114 citations
Originality Highly original
AI Analysis

This addresses the challenge for data scientists and privacy engineers in creating efficient and correct differentially private algorithms, representing a novel method for automating a known bottleneck.

The authors tackled the problem of manually designing differentially private algorithms by introducing DPGen, an automated program synthesizer that converts non-private code into differentially private versions with optimized noise calibration, achieving synthesis within 120 seconds on standard benchmarks.

Differential privacy has become a de facto standard for releasing data in a privacy-preserving way. Creating a differentially private algorithm is a process that often starts with a noise-free (non-private) algorithm. The designer then decides where to add noise, and how much of it to add. This can be a non-trivial process -- if not done carefully, the algorithm might either violate differential privacy or have low utility. In this paper, we present DPGen, a program synthesizer that takes in non-private code (without any noise) and automatically synthesizes its differentially private version (with carefully calibrated noise). Under the hood, DPGen uses novel algorithms to automatically generate a sketch program with candidate locations for noise, and then optimize privacy proof and noise scales simultaneously on the sketch program. Moreover, DPGen can synthesize sophisticated mechanisms that adaptively process queries until a specified privacy budget is exhausted. When evaluated on standard benchmarks, DPGen is able to generate differentially private mechanisms that optimize simple utility functions within 120 seconds. It is also powerful enough to synthesize adaptive privacy mechanisms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes