LGCROCMLApr 19, 2021

Provable Robustness of Adversarial Training for Learning Halfspaces with Noise

arXiv:2104.09437v115 citations
Originality Highly original
AI Analysis

This provides provable robustness guarantees for adversarial training in noisy settings, addressing a foundational challenge in robust machine learning.

The paper tackles the problem of learning adversarially robust halfspaces with agnostic label noise, showing that adversarial training on binary cross-entropy loss yields robust classifiers with error bounds like $ ilde O(\sqrt{\mathsf{OPT}_{2,r}})$ for $p=2$, and using a nonconvex sigmoidal loss improves this to $O(\mathsf{OPT}_{2,r})$ for $p=2$.

We analyze the properties of adversarial training for learning adversarially robust halfspaces in the presence of agnostic label noise. Denoting $\mathsf{OPT}_{p,r}$ as the best robust classification error achieved by a halfspace that is robust to perturbations of $\ell_{p}$ balls of radius $r$, we show that adversarial training on the standard binary cross-entropy loss yields adversarially robust halfspaces up to (robust) classification error $\tilde O(\sqrt{\mathsf{OPT}_{2,r}})$ for $p=2$, and $\tilde O(d^{1/4} \sqrt{\mathsf{OPT}_{\infty, r}} + d^{1/2} \mathsf{OPT}_{\infty,r})$ when $p=\infty$. Our results hold for distributions satisfying anti-concentration properties enjoyed by log-concave isotropic distributions among others. We additionally show that if one instead uses a nonconvex sigmoidal loss, adversarial training yields halfspaces with an improved robust classification error of $O(\mathsf{OPT}_{2,r})$ for $p=2$, and $O(d^{1/4}\mathsf{OPT}_{\infty, r})$ when $p=\infty$. To the best of our knowledge, this is the first work to show that adversarial training provably yields robust classifiers in the presence of noise.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes