LG CRMar 14, 2024

Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk

Zhangheng Li, Junyuan Hong, Bo Li, Zhangyang Wang

arXiv:2403.09450v219.322 citationsHas Code2024 IEEE Conference on Secure and Trustworthy Machine Learning (SaTML)

Originality Incremental advance

AI Analysis

This work highlights a severe privacy risk for users of diffusion models, showing that fine-tuning can exacerbate existing vulnerabilities, making it an incremental but critical finding.

The paper reveals that fine-tuning pre-trained diffusion models with manipulated data can amplify privacy risks, increasing membership inference attack AUC by 5.4% and extracted private samples from nearly 0 to 15.8 per domain on average.

While diffusion models have recently demonstrated remarkable progress in generating realistic images, privacy risks also arise: published models or APIs could generate training images and thus leak privacy-sensitive training information. In this paper, we reveal a new risk, Shake-to-Leak (S2L), that fine-tuning the pre-trained models with manipulated data can amplify the existing privacy risks. We demonstrate that S2L could occur in various standard fine-tuning strategies for diffusion models, including concept-injection methods (DreamBooth and Textual Inversion) and parameter-efficient methods (LoRA and Hypernetwork), as well as their combinations. In the worst case, S2L can amplify the state-of-the-art membership inference attack (MIA) on diffusion models by $5.4\%$ (absolute difference) AUC and can increase extracted private samples from almost $0$ samples to $15.8$ samples on average per target domain. This discovery underscores that the privacy risk with diffusion models is even more severe than previously recognized. Codes are available at https://github.com/VITA-Group/Shake-to-Leak.

View on arXiv PDF Code

Similar