LGCRCVJan 17, 2023

Denoising Diffusion Probabilistic Models as a Defense against Adversarial Attacks

arXiv:2301.06871v18 citationsh-index: 3Has Code
Originality Synthesis-oriented
AI Analysis

This addresses the security issue of adversarial attacks for neural network users, but it is incremental as it applies an existing method to a new defense context.

The paper tackles the problem of neural networks' vulnerability to adversarial attacks by using Denoising Diffusion Probabilistic Models (DDPM) as a purification technique, achieving an improvement in robust accuracy by up to 88% of the original model's accuracy on the PatchCamelyon dataset.

Neural Networks are infamously sensitive to small perturbations in their inputs, making them vulnerable to adversarial attacks. This project evaluates the performance of Denoising Diffusion Probabilistic Models (DDPM) as a purification technique to defend against adversarial attacks. This works by adding noise to an adversarial example before removing it through the reverse process of the diffusion model. We evaluate the approach on the PatchCamelyon data set for histopathologic scans of lymph node sections and find an improvement of the robust accuracy by up to 88\% of the original model's accuracy, constituting a considerable improvement over the vanilla model and our baselines. The project code is located at https://github.com/ankile/Adversarial-Diffusion.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes