LGCVJun 26, 2021

Multi-stage Optimization based Adversarial Training

arXiv:2106.15357v15 citations
Originality Incremental advance
AI Analysis

This addresses a specific problem in adversarial robustness for machine learning practitioners, offering an incremental improvement to training efficiency and stability.

The paper tackles catastrophic overfitting in single-step adversarial training by introducing multi-step adversarial examples and a multi-stage optimization method (MOAT) to reduce training overhead. Experiments on CIFAR-10 and CIFAR-100 show MOAT achieves better robustness than existing methods under similar training costs.

In the field of adversarial robustness, there is a common practice that adopts the single-step adversarial training for quickly developing adversarially robust models. However, the single-step adversarial training is most likely to cause catastrophic overfitting, as after a few training epochs it will be hard to generate strong adversarial examples to continuously boost the adversarial robustness. In this work, we aim to avoid the catastrophic overfitting by introducing multi-step adversarial examples during the single-step adversarial training. Then, to balance the large training overhead of generating multi-step adversarial examples, we propose a Multi-stage Optimization based Adversarial Training (MOAT) method that periodically trains the model on mixed benign examples, single-step adversarial examples, and multi-step adversarial examples stage by stage. In this way, the overall training overhead is reduced significantly, meanwhile, the model could avoid catastrophic overfitting. Extensive experiments on CIFAR-10 and CIFAR-100 datasets demonstrate that under similar amount of training overhead, the proposed MOAT exhibits better robustness than either single-step or multi-step adversarial training methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes