LGCROct 28, 2023

Purify++: Improving Diffusion-Purification with Advanced Diffusion Models and Control of Randomness

arXiv:2310.18762v117 citationsh-index: 7
Originality Incremental advance
AI Analysis

This work addresses AI safety by enhancing adversarial purification, but it is incremental as it builds on existing diffusion-based approaches.

The paper tackled the problem of defending neural network classifiers against adversarial attacks by improving diffusion purification methods, resulting in Purify++, which achieves state-of-the-art performance against several attacks.

Adversarial attacks can mislead neural network classifiers. The defense against adversarial attacks is important for AI safety. Adversarial purification is a family of approaches that defend adversarial attacks with suitable pre-processing. Diffusion models have been shown to be effective for adversarial purification. Despite their success, many aspects of diffusion purification still remain unexplored. In this paper, we investigate and improve upon three limiting designs of diffusion purification: the use of an improved diffusion model, advanced numerical simulation techniques, and optimal control of randomness. Based on our findings, we propose Purify++, a new diffusion purification algorithm that is now the state-of-the-art purification method against several adversarial attacks. Our work presents a systematic exploration of the limits of diffusion purification methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes