CRLGJan 5, 2018

Differentially Private Releasing via Deep Generative Model (Technical Report)

arXiv:1801.01594v274 citations
Originality Incremental advance
AI Analysis

It addresses the problem of balancing privacy and data utility for data curators and analysts, offering a novel approach but with incremental improvements over existing generative methods.

The paper tackles the challenge of privacy-preserving data release for complex data like images and text by introducing dp-GAN, a framework that trains a deep generative model with differential privacy to produce synthetic data, achieving practical scalability and utility in analyses.

Privacy-preserving releasing of complex data (e.g., image, text, audio) represents a long-standing challenge for the data mining research community. Due to rich semantics of the data and lack of a priori knowledge about the analysis task, excessive sanitization is often necessary to ensure privacy, leading to significant loss of the data utility. In this paper, we present dp-GAN, a general private releasing framework for semantic-rich data. Instead of sanitizing and then releasing the data, the data curator publishes a deep generative model which is trained using the original data in a differentially private manner; with the generative model, the analyst is able to produce an unlimited amount of synthetic data for arbitrary analysis tasks. In contrast of alternative solutions, dp-GAN highlights a set of key features: (i) it provides theoretical privacy guarantee via enforcing the differential privacy principle; (ii) it retains desirable utility in the released model, enabling a variety of otherwise impossible analyses; and (iii) most importantly, it achieves practical training scalability and stability by employing multi-fold optimization strategies. Through extensive empirical evaluation on benchmark datasets and analyses, we validate the efficacy of dp-GAN.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes