CVCRJan 12

Universal Adversarial Purification with DDIM Metric Loss for Stable Diffusion

arXiv:2601.07253v1
Originality Incremental advance
AI Analysis

This addresses security vulnerabilities in Stable Diffusion for real-world generative AI applications, though it is incremental as it builds on existing purification methods.

The paper tackles the problem of adversarial noise degrading Stable Diffusion outputs by proposing Universal Diffusion Adversarial Purification (UDAP), which effectively removes noise using DDIM metric loss and dynamic epoch adjustment, achieving robustness against diverse attacks like PID and Anti-DreamBooth.

Stable Diffusion (SD) often produces degraded outputs when the training dataset contains adversarial noise. Adversarial purification offers a promising solution by removing adversarial noise from contaminated data. However, existing purification methods are primarily designed for classification tasks and fail to address SD-specific adversarial strategies, such as attacks targeting the VAE encoder, UNet denoiser, or both. To address the gap in SD security, we propose Universal Diffusion Adversarial Purification (UDAP), a novel framework tailored for defending adversarial attacks targeting SD models. UDAP leverages the distinct reconstruction behaviors of clean and adversarial images during Denoising Diffusion Implicit Models (DDIM) inversion to optimize the purification process. By minimizing the DDIM metric loss, UDAP can effectively remove adversarial noise. Additionally, we introduce a dynamic epoch adjustment strategy that adapts optimization iterations based on reconstruction errors, significantly improving efficiency without sacrificing purification quality. Experiments demonstrate UDAP's robustness against diverse adversarial methods, including PID (VAE-targeted), Anti-DreamBooth (UNet-targeted), MIST (hybrid), and robustness-enhanced variants like Anti-Diffusion (Anti-DF) and MetaCloak. UDAP also generalizes well across SD versions and text prompts, showcasing its practical applicability in real-world scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes