LGAICLCRDec 10, 2023

Beyond Gradient and Priors in Privacy Attacks: Leveraging Pooler Layer Inputs of Language Models in Federated Learning

arXiv:2312.05720v46 citations
Originality Incremental advance
AI Analysis

This work highlights a critical privacy risk for users of federated learning systems, especially with complex language models, and is incremental in focusing on architectural vulnerabilities rather than gradients or priors.

The paper tackles the problem of privacy leakage in federated learning for language models by proposing a two-stage attack that exploits model architecture vulnerabilities, achieving superior performance across datasets and scenarios.

Language models trained via federated learning (FL) demonstrate impressive capabilities in handling complex tasks while protecting user privacy. Recent studies indicate that leveraging gradient information and prior knowledge can potentially reveal training samples within FL setting. However, these investigations have overlooked the potential privacy risks tied to the intrinsic architecture of the models. This paper presents a two-stage privacy attack strategy that targets the vulnerabilities in the architecture of contemporary language models, significantly enhancing attack performance by initially recovering certain feature directions as additional supervisory signals. Our comparative experiments demonstrate superior attack performance across various datasets and scenarios, highlighting the privacy leakage risk associated with the increasingly complex architectures of language models. We call for the community to recognize and address these potential privacy risks in designing large language models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes