LGMLApr 10, 2018

Adversarial Training Versus Weight Decay

arXiv:1804.03308v324 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of enhancing robustness in performance-critical ML models, offering a comparative analysis that is incremental in nature.

The paper compares adversarial training and weight decay for improving model robustness to input perturbations, finding that weight decay is more stable and reduces generalization errors across a broader range of regimes, with the combination yielding a small model robust to multiple white-box attacks.

Performance-critical machine learning models should be robust to input perturbations not seen during training. Adversarial training is a method for improving a model's robustness to some perturbations by including them in the training process, but this tends to exacerbate other vulnerabilities of the model. The adversarial training framework has the effect of translating the data with respect to the cost function, while weight decay has a scaling effect. Although weight decay could be considered a crude regularization technique, it appears superior to adversarial training as it remains stable over a broader range of regimes and reduces all generalization errors. Equipped with these abstractions, we provide key baseline results and methodology for characterizing robustness. The two approaches can be combined to yield one small model that demonstrates good robustness to several white-box attacks associated with different metrics.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes