LGCRMay 3, 2022

Privacy Amplification via Random Participation in Federated Learning

arXiv:2205.01556v12 citationsh-index: 11
Originality Incremental advance
AI Analysis

This work addresses privacy concerns for users in federated learning systems, offering incremental improvements over existing methods.

The paper tackles the problem of privacy amplification in federated learning by analyzing random client participation alongside local dataset subsampling, showing that it improves privacy guarantees compared to local subsampling alone, with results indicating close-to-centralized privacy for small datasets and better performance for large ones.

Running a randomized algorithm on a subsampled dataset instead of the entire dataset amplifies differential privacy guarantees. In this work, in a federated setting, we consider random participation of the clients in addition to subsampling their local datasets. Since such random participation of the clients creates correlation among the samples of the same client in their subsampling, we analyze the corresponding privacy amplification via non-uniform subsampling. We show that when the size of the local datasets is small, the privacy guarantees via random participation is close to those of the centralized setting, in which the entire dataset is located in a single host and subsampled. On the other hand, when the local datasets are large, observing the output of the algorithm may disclose the identities of the sampled clients with high confidence. Our analysis reveals that, even in this case, privacy guarantees via random participation outperform those via only local subsampling.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes