LGCVApr 16, 2025

SemDiff: Generating Natural Unrestricted Adversarial Examples via Semantic Attributes Optimization in Diffusion Models

arXiv:2504.11923v1h-index: 13
Originality Highly original
AI Analysis

This work addresses a critical safety issue in deep learning models by improving the naturalness and imperceptibility of adversarial attacks, though it is incremental as it builds on existing diffusion model approaches.

The paper tackles the problem of generating unrestricted adversarial examples (UAEs) that lack naturalness and imperceptibility by proposing SemDiff, which optimizes semantic attributes in diffusion models to create more natural and imperceptible UAEs, achieving higher attack success rates and better imperceptibility compared to state-of-the-art methods.

Unrestricted adversarial examples (UAEs), allow the attacker to create non-constrained adversarial examples without given clean samples, posing a severe threat to the safety of deep learning models. Recent works utilize diffusion models to generate UAEs. However, these UAEs often lack naturalness and imperceptibility due to simply optimizing in intermediate latent noises. In light of this, we propose SemDiff, a novel unrestricted adversarial attack that explores the semantic latent space of diffusion models for meaningful attributes, and devises a multi-attributes optimization approach to ensure attack success while maintaining the naturalness and imperceptibility of generated UAEs. We perform extensive experiments on four tasks on three high-resolution datasets, including CelebA-HQ, AFHQ and ImageNet. The results demonstrate that SemDiff outperforms state-of-the-art methods in terms of attack success rate and imperceptibility. The generated UAEs are natural and exhibit semantically meaningful changes, in accord with the attributes' weights. In addition, SemDiff is found capable of evading different defenses, which further validates its effectiveness and threatening.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes