CV CR LGMay 26, 2025

Structure Disruption: Subverting Malicious Diffusion-Based Inpainting via Self-Attention Query Perturbation

Yuhao He, Jinyu Tian, Haiwei Wu, Jianqing Li

arXiv:2505.19425v13.6h-index: 14

Originality Highly original

AI Analysis

This addresses the societal risk of adversaries exploiting user images from social media to create misleading content, offering a targeted defense against inpainting-based editing.

The paper tackles the problem of malicious image inpainting using diffusion models by proposing the Structure Disruption Attack (SDA), which disrupts self-attention queries to prevent coherent image generation, achieving state-of-the-art protection performance in experiments.

The rapid advancement of diffusion models has enhanced their image inpainting and editing capabilities but also introduced significant societal risks. Adversaries can exploit user images from social media to generate misleading or harmful content. While adversarial perturbations can disrupt inpainting, global perturbation-based methods fail in mask-guided editing tasks due to spatial constraints. To address these challenges, we propose Structure Disruption Attack (SDA), a powerful protection framework for safeguarding sensitive image regions against inpainting-based editing. Building upon the contour-focused nature of self-attention mechanisms of diffusion models, SDA optimizes perturbations by disrupting queries in self-attention during the initial denoising step to destroy the contour generation process. This targeted interference directly disrupts the structural generation capability of diffusion models, effectively preventing them from producing coherent images. We validate our motivation through visualization techniques and extensive experiments on public datasets, demonstrating that SDA achieves state-of-the-art (SOTA) protection performance while maintaining strong robustness.

View on arXiv PDF

Similar