LGCRJun 22, 2022

Guided Diffusion Model for Adversarial Purification from Random Noise

arXiv:2206.10875v148 citationsh-index: 7
Originality Incremental advance
AI Analysis

This provides a strong defense against adversarial attacks for machine learning security, with incremental improvements over existing methods.

The paper tackles adversarial attacks by proposing a guided diffusion purification approach, achieving 89.62% robust accuracy under PGD-L_inf attack on CIFAR-10 and outperforming randomized smoothing by 5% in certified robustness for larger radii.

In this paper, we propose a novel guided diffusion purification approach to provide a strong defense against adversarial attacks. Our model achieves 89.62% robust accuracy under PGD-L_inf attack (eps = 8/255) on the CIFAR-10 dataset. We first explore the essential correlations between unguided diffusion models and randomized smoothing, enabling us to apply the models to certified robustness. The empirical results show that our models outperform randomized smoothing by 5% when the certified L2 radius r is larger than 0.5.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes