LGCRMLJun 15, 2020

GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators

arXiv:2006.08265v2216 citations
AI Analysis

This addresses privacy concerns in domains with sensitive data, enabling data sharing for machine learning applications, though it appears incremental as it builds on prior work with improvements in gradient distortion and model depth.

The paper tackles the problem of generating synthetic data with privacy guarantees for sensitive domains like medical data, proposing GS-WGAN, which distorts gradients more precisely to train deeper models and generate higher-quality samples, outperforming state-of-the-art methods across multiple metrics and datasets.

The wide-spread availability of rich data has fueled the growth of machine learning applications in numerous domains. However, growth in domains with highly-sensitive data (e.g., medical) is largely hindered as the private nature of data prohibits it from being shared. To this end, we propose Gradient-sanitized Wasserstein Generative Adversarial Networks (GS-WGAN), which allows releasing a sanitized form of the sensitive data with rigorous privacy guarantees. In contrast to prior work, our approach is able to distort gradient information more precisely, and thereby enabling training deeper models which generate more informative samples. Moreover, our formulation naturally allows for training GANs in both centralized and federated (i.e., decentralized) data scenarios. Through extensive experiments, we find our approach consistently outperforms state-of-the-art approaches across multiple metrics (e.g., sample quality) and datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes