AICLLGOct 10, 2023

Exploring Memorization in Fine-tuned Language Models

arXiv:2310.06714v254 citationsh-index: 18Has Code
Originality Incremental advance
AI Analysis

This addresses privacy and copyright concerns for users of fine-tuned models, though it is incremental as it extends prior pre-training studies to fine-tuning.

The paper investigates memorization in language models during fine-tuning, revealing strong disparities across tasks and linking memorization to attention score distribution.

Large language models (LLMs) have shown great capabilities in various tasks but also exhibited memorization of training data, raising tremendous privacy and copyright concerns. While prior works have studied memorization during pre-training, the exploration of memorization during fine-tuning is rather limited. Compared to pre-training, fine-tuning typically involves more sensitive data and diverse objectives, thus may bring distinct privacy risks and unique memorization behaviors. In this work, we conduct the first comprehensive analysis to explore language models' (LMs) memorization during fine-tuning across tasks. Our studies with open-sourced and our own fine-tuned LMs across various tasks indicate that memorization presents a strong disparity among different fine-tuning tasks. We provide an intuitive explanation of this task disparity via sparse coding theory and unveil a strong correlation between memorization and attention score distribution.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes