LGAICLJan 4, 2022

Submix: Practical Private Prediction for Large-Scale Language Models

arXiv:2201.00971v130 citations
AI Analysis

It addresses privacy risks for users of large language models fine-tuned on private data, but it is incremental as it builds on differential privacy concepts.

The paper tackles the problem of language models memorizing training data, which compromises privacy, by introducing SubMix, a practical protocol for private next-token prediction that prevents privacy violations while maintaining utility, and it is the first to maintain privacy when publicly releasing tens of thousands of predictions for models like GPT-2.

Recent data-extraction attacks have exposed that language models can memorize some training samples verbatim. This is a vulnerability that can compromise the privacy of the model's training data. In this work, we introduce SubMix: a practical protocol for private next-token prediction designed to prevent privacy violations by language models that were fine-tuned on a private corpus after pre-training on a public corpus. We show that SubMix limits the leakage of information that is unique to any individual user in the private corpus via a relaxation of group differentially private prediction. Importantly, SubMix admits a tight, data-dependent privacy accounting mechanism, which allows it to thwart existing data-extraction attacks while maintaining the utility of the language model. SubMix is the first protocol that maintains privacy even when publicly releasing tens of thousands of next-token predictions made by large transformer-based models such as GPT-2.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes