CR DSJun 5, 2020

Differentially private partition selection

Damien Desfontaines, James Voss, Bryant Gipson, Chinmoy Mandayam

arXiv:2006.03684v413.624 citations

Originality Incremental advance

AI Analysis

This work addresses privacy concerns in data analysis for scenarios involving unbounded partitions, but it is incremental as it builds on existing formalizations like differentially private set union.

The paper tackles the problem of ensuring differential privacy for GROUP BY queries by making the set of partitions released differentially private, proposing a simple, optimal mechanism that maximizes the number of released partitions in the setting where each user is associated with a single partition.

Many data analysis operations can be expressed as a GROUP BY query on an unbounded set of partitions, followed by a per-partition aggregation. To make such a query differentially private, adding noise to each aggregation is not enough: we also need to make sure that the set of partitions released is also differentially private. This problem is not new, and it was recently formally introduced as differentially private set union. In this work, we continue this area of study, and focus on the common setting where each user is associated with a single partition. In this setting, we propose a simple, optimal differentially private mechanism that maximizes the number of released partitions. We discuss implementation considerations, as well as the possible extension of this approach to the setting where each user contributes to a fixed, small number of partitions.

View on arXiv PDF

Similar