LGCRJun 14, 2024

Adaptive Randomized Smoothing: Certified Adversarial Robustness for Multi-Step Defences

arXiv:2406.10427v39 citationsHas Code
Originality Highly original
AI Analysis

This addresses the challenge of ensuring certified robustness for adaptive machine learning models against adversarial attacks, with incremental advancements in certification methods.

The paper tackles the problem of certifying adversarial robustness for adaptive models by proposing Adaptive Randomized Smoothing (ARS), which extends randomized smoothing using f-Differential Privacy to handle multi-step defenses, and shows improvements in test accuracy by 1 to 15% points on benchmarks and up to 1.6% points on ImageNet.

We propose Adaptive Randomized Smoothing (ARS) to certify the predictions of our test-time adaptive models against adversarial examples. ARS extends the analysis of randomized smoothing using $f$-Differential Privacy to certify the adaptive composition of multiple steps. For the first time, our theory covers the sound adaptive composition of general and high-dimensional functions of noisy inputs. We instantiate ARS on deep image classification to certify predictions against adversarial examples of bounded $L_{\infty}$ norm. In the $L_{\infty}$ threat model, ARS enables flexible adaptation through high-dimensional input-dependent masking. We design adaptivity benchmarks, based on CIFAR-10 and CelebA, and show that ARS improves standard test accuracy by $1$ to $15\%$ points. On ImageNet, ARS improves certified test accuracy by up to $1.6\%$ points over standard RS without adaptivity. Our code is available at https://github.com/ubc-systopia/adaptive-randomized-smoothing .

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes