CVDec 13, 2024

Real-time Identity Defenses against Malicious Personalization of Diffusion Models

Hanzhong Guo, Shen Nie, Chao Du, Tianyu Pang, Hao Sun, Chongxuan Li

arXiv:2412.09844v26.53 citationsh-index: 41Has Code

Originality Incremental advance

AI Analysis

This addresses identity theft risks for individuals by enabling real-time defenses, though it is incremental as it builds on existing adversarial perturbation methods.

The paper tackles the problem of malicious identity replication using personalized diffusion models by introducing the Real-time Identity Defender (RID), which generates adversarial perturbations in a single forward pass, achieving defense times as low as 0.12 seconds on a GPU and 1.1 seconds on a CPU while providing effective protection.

Personalized generative diffusion models, capable of synthesizing highly realistic images based on a few reference portraits, may pose substantial social, ethical, and legal risks via identity replication. Existing defense mechanisms rely on computationally intensive adversarial perturbations tailored to individual images, rendering them impractical for real-world deployment. This study introduces the Real-time Identity Defender (RID), a neural network designed to generate adversarial perturbations through a single forward pass, bypassing the need for image-specific optimization. RID achieves unprecedented efficiency, with defense times as low as 0.12 seconds on a single NVIDIA A100 80G GPU (4,400 times faster than leading methods) and 1.1 seconds per image on a standard Intel i9 CPU, making it suitable for edge devices such as smartphones. Despite its efficiency, RID achieves promising protection performance across visual and quantitative benchmarks, effectively mitigating identity replication risks. Our analysis reveals that RID's perturbations mimic the efficacy of traditional defenses while exhibiting properties distinct from natural noise, such as Gaussian perturbations. To enhance robustness, we extend RID into an ensemble framework that integrates multiple pre-trained text-to-image diffusion models, ensuring resilience against black-box attacks and post-processing techniques, including image compression and purification. Our model is envisioned to play a crucial role in safeguarding portrait rights, thereby preventing illegal and unethical uses.

View on arXiv PDF Code

Similar