LGAICVOct 15, 2020

Overfitting or Underfitting? Understand Robustness Drop in Adversarial Training

arXiv:2010.08034v18 citations
AI Analysis

This addresses a key robustness issue in adversarial training for machine learning security, offering a more efficient solution.

The paper investigates why robustness decreases with prolonged adversarial training, attributing it to perturbation underfitting rather than overfitting, and introduces APART, an adaptive framework that achieves comparable or better robustness than PGD-10 at about 1/4 of the computational cost.

Our goal is to understand why the robustness drops after conducting adversarial training for too long. Although this phenomenon is commonly explained as overfitting, our analysis suggest that its primary cause is perturbation underfitting. We observe that after training for too long, FGSM-generated perturbations deteriorate into random noise. Intuitively, since no parameter updates are made to strengthen the perturbation generator, once this process collapses, it could be trapped in such local optima. Also, sophisticating this process could mostly avoid the robustness drop, which supports that this phenomenon is caused by underfitting instead of overfitting. In the light of our analyses, we propose APART, an adaptive adversarial training framework, which parameterizes perturbation generation and progressively strengthens them. Shielding perturbations from underfitting unleashes the potential of our framework. In our experiments, APART provides comparable or even better robustness than PGD-10, with only about 1/4 of its computational cost.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes