CRDBDSNov 16, 2021

Improved Pan-Private Stream Density Estimation

arXiv:2111.08784v1
Originality Incremental advance
AI Analysis

This work addresses privacy-preserving data analysis for streaming applications, offering incremental improvements in efficiency for scenarios requiring robust privacy guarantees.

The authors tackled the problem of estimating user density in streaming data under strong user-level pan-privacy, developing new algorithms that outperform conventional methods both theoretically and experimentally.

Differential privacy is a rigorous definition for privacy that guarantees that any analysis performed on a sensitive dataset leaks no information about the individuals whose data are contained therein. In this work, we develop new differentially private algorithms to analyze streaming data. Specifically, we consider the problem of estimating the density of a stream of users (or, more generally, elements), which expresses the fraction of all users that actually appear in the stream. We focus on one of the strongest privacy guarantees for the streaming model, namely user-level pan-privacy, which ensures that the privacy of any user is protected, even against an adversary that observes, on rare occasions, the internal state of the algorithm. Our proposed algorithms employ optimally all the allocated privacy budget, are specially tailored for the streaming model, and, hence, outperform both theoretically and experimentally the conventional sampling-based approach.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes