CVLGMay 27, 2025

What is Adversarial Training for Diffusion Models?

arXiv:2505.21742v1h-index: 24
Originality Incremental advance
AI Analysis

This addresses robustness issues in diffusion models for applications like image generation, though it is incremental as it adapts adversarial training from classifiers.

The paper tackles the problem of adversarial training for diffusion models, showing it requires equivariance to align with data distribution, and demonstrates improved robustness to noise, corruption, and attacks on benchmarks like CIFAR-10 and CelebA.

We answer the question in the title, showing that adversarial training (AT) for diffusion models (DMs) fundamentally differs from classifiers: while AT in classifiers enforces output invariance, AT in DMs requires equivariance to keep the diffusion process aligned with the data distribution. AT is a way to enforce smoothness in the diffusion flow, improving robustness to outliers and corrupted data. Unlike prior art, our method makes no assumptions about the noise model and integrates seamlessly into diffusion training by adding random noise, similar to randomized smoothing, or adversarial noise, akin to AT. This enables intrinsic capabilities such as handling noisy data, dealing with extreme variability such as outliers, preventing memorization, and improving robustness. We rigorously evaluate our approach with proof-of-concept datasets with known distributions in low- and high-dimensional space, thereby taking a perfect measure of errors; we further evaluate on standard benchmarks such as CIFAR-10, CelebA and LSUN Bedroom, showing strong performance under severe noise, data corruption, and iterative adversarial attacks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes