CL LGMay 26, 2022

Differentially Private Decoding in Large Language Models

Jimit Majmudar, Christophe Dupuy, Charith Peris, Sami Smaili, Rahul Gupta, Richard Zemel

Amazon

arXiv:2205.13621v27.045 citationsh-index: 78

Originality Incremental advance

AI Analysis

This addresses privacy risks for users of LLMs in NLP applications, though it is incremental as it builds on existing differential privacy methods.

The paper tackles the problem of privacy leakage in large language models (LLMs) due to memorization of training data, proposing a differentially private decoding mechanism that is model-agnostic and computationally lightweight, with experimental results demonstrating a privacy-utility trade-off.

Recent large-scale natural language processing (NLP) systems use a pre-trained Large Language Model (LLM) on massive and diverse corpora as a headstart. In practice, the pre-trained model is adapted to a wide array of tasks via fine-tuning on task-specific datasets. LLMs, while effective, have been shown to memorize instances of training data thereby potentially revealing private information processed during pre-training. The potential leakage might further propagate to the downstream tasks for which LLMs are fine-tuned. On the other hand, privacy-preserving algorithms usually involve retraining from scratch, which is prohibitively expensive for LLMs. In this work, we propose a simple, easy to interpret, and computationally lightweight perturbation mechanism to be applied to an already trained model at the decoding stage. Our perturbation mechanism is model-agnostic and can be used in conjunction with any LLM. We provide theoretical analysis showing that the proposed mechanism is differentially private, and experimental results showing a privacy-utility trade-off.

View on arXiv PDF

Similar