CRDSJun 5, 2020

Differentially private partition selection

arXiv:2006.03684v424 citations
Originality Incremental advance
AI Analysis

This work addresses privacy concerns in data analysis for scenarios involving unbounded partitions, but it is incremental as it builds on existing formalizations like differentially private set union.

The paper tackles the problem of ensuring differential privacy for GROUP BY queries by making the set of partitions released differentially private, proposing a simple, optimal mechanism that maximizes the number of released partitions in the setting where each user is associated with a single partition.

Many data analysis operations can be expressed as a GROUP BY query on an unbounded set of partitions, followed by a per-partition aggregation. To make such a query differentially private, adding noise to each aggregation is not enough: we also need to make sure that the set of partitions released is also differentially private. This problem is not new, and it was recently formally introduced as differentially private set union. In this work, we continue this area of study, and focus on the common setting where each user is associated with a single partition. In this setting, we propose a simple, optimal differentially private mechanism that maximizes the number of released partitions. We discuss implementation considerations, as well as the possible extension of this approach to the setting where each user contributes to a fixed, small number of partitions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes