LGFeb 20, 2018

Local Differential Privacy for Evolving Data

arXiv:1802.07128v397 citations
Originality Highly original
AI Analysis

This addresses a critical privacy issue for large-scale data collection systems that periodically recollect user data, preventing long-term privacy erosion in local differential privacy deployments.

The paper tackles the problem of maintaining up-to-date statistics over time in local differential privacy systems, where existing methods fail to provide meaningful long-term privacy guarantees. It introduces a new technique that ensures privacy degrades only with changes in the underlying distribution, not the number of collection periods, and applies it to tracking changing statistics and frequency estimation.

There are now several large scale deployments of differential privacy used to collect statistical information about users. However, these deployments periodically recollect the data and recompute the statistics using algorithms designed for a single use. As a result, these systems do not provide meaningful privacy guarantees over long time scales. Moreover, existing techniques to mitigate this effect do not apply in the "local model" of differential privacy that these systems use. In this paper, we introduce a new technique for local differential privacy that makes it possible to maintain up-to-date statistics over time, with privacy guarantees that degrade only in the number of changes in the underlying distribution rather than the number of collection periods. We use our technique for tracking a changing statistic in the setting where users are partitioned into an unknown collection of groups, and at every time period each user draws a single bit from a common (but changing) group-specific distribution. We also provide an application to frequency and heavy-hitter estimation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes