LGJan 31, 2025

Differentially Private In-context Learning via Sampling Few-shot Mixed with Zero-shot Outputs

arXiv:2501.19287v13 citationsh-index: 33
Originality Highly original
AI Analysis

This addresses privacy risks in LLM generation tasks for users handling sensitive data, offering a novel method to enhance privacy-utility tradeoffs.

The paper tackles the problem of privacy leakage in in-context learning for generation tasks by proposing a differentially private decoding framework that mixes one-shot and zero-shot outputs, achieving a privacy guarantee of ε=2 with only a 0.3% decrease in ROUGE-L F1 score on the SAMSum dataset.

In-context learning (ICL) has shown promising improvement in downstream task adaptation of LLMs by augmenting prompts with relevant input-output examples (demonstrations). However, the ICL demonstrations can contain privacy-sensitive information, which can be leaked and/or regurgitated by the LLM output. Differential Privacy (DP), a widely adopted privacy safeguard, has emerged to mitigate this privacy leakage, with recent work demonstrating strong privacy-utility tradeoffs in classification tasks for ICL. However, generation tasks for ICL are challenging due to the high-dimensional output space of open-ended generation. To this end, we propose $\texttt{dps-mozo}$, Differentially Private Sampling by Mixing One-shot with Zero-shot Outputs, a decoding framework that generates DP text by sampling from the product of multiple one-shot outputs mixed with a zero-shot output. This mixing effectively reduces the amount of information that can be leaked by each demonstration. By utilizing the inherent randomness in sampling from the mixed distributions, we can achieve DP without adding noise, thereby improving the privacy-utility tradeoff. Our experimental evaluations show $\texttt{dps-mozo}$ can achieve a strong privacy guarantee, $ε=2$, with minimal utility degradation compared to non-private few-shot learning, $\textbf{0.3}$% ROUGE-L F1 score decrease on the SAMSum dataset with Gemma 2 2B.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes