CL AIJun 30, 2025

Impact of Fine-Tuning Methods on Memorization in Large Language Models

Jie Hou, Chuxiong Wu, Lannan Luo, Qiang Zeng

arXiv:2507.00258v12 citationsh-index: 18

Originality Incremental advance

AI Analysis

This addresses privacy concerns for users and developers of fine-tuned LLMs, though it is incremental as it builds on existing fine-tuning paradigms.

The study tackled the problem of privacy risks from memorization in large language models during fine-tuning by comparing parameter-based and prompt-based methods, finding that prompt-based fine-tuning achieves competitive performance with lower vulnerability to membership inference attacks and maintains low memorization across model scales.

As the capabilities of pre-trained large language models (LLMs) continue to advance, the "pre-train and fine-tune" paradigm has become increasingly mainstream, leading to the development of various fine-tuning methods. However, the privacy risks arising from memorization during fine-tuning have received relatively little attention. To address this gap, we categorize popular fine-tuning approaches and assess their impact on memorization through the lens of membership inference attacks (MIAs). Our results show that, compared to parameter-based fine-tuning, prompt-based fine-tuning achieves competitive performance while exhibiting lower vulnerability to MIAs. Furthermore, prompt-based methods maintain low memorization regardless of model scale. These findings suggest that parameter-based fine-tuning is more prone to leaking private information, whereas prompt-based fine-tuning serves as a more privacy-preserving option.

View on arXiv PDF

Similar