LGDCFeb 17, 2022

LAMP: Extracting Text from Gradients with Language Model Priors

arXiv:2202.08827v286 citations
Originality Highly original
AI Analysis

This work addresses privacy vulnerabilities in federated learning for text data, showing that gradients leak more information than previously thought, which is a significant concern for users and developers in secure AI applications.

The paper tackles the problem of reconstructing sensitive text data from gradients in federated learning, proposing LAMP, which successfully recovers original text with 5x more bigrams and 23% longer subsequences on average than prior methods.

Recent work shows that sensitive user data can be reconstructed from gradient updates, breaking the key privacy promise of federated learning. While success was demonstrated primarily on image data, these methods do not directly transfer to other domains such as text. In this work, we propose LAMP, a novel attack tailored to textual data, that successfully reconstructs original text from gradients. Our attack is based on two key insights: (i) modeling prior text probability with an auxiliary language model, guiding the search towards more natural text, and (ii) alternating continuous and discrete optimization, which minimizes reconstruction loss on embeddings, while avoiding local minima by applying discrete text transformations. Our experiments demonstrate that LAMP is significantly more effective than prior work: it reconstructs 5x more bigrams and 23% longer subsequences on average. Moreover, we are the first to recover inputs from batch sizes larger than 1 for textual models. These findings indicate that gradient updates of models operating on textual data leak more information than previously thought.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes