MLAILGFeb 5, 2021

Adversarial Training Makes Weight Loss Landscape Sharper in Logistic Regression

arXiv:2102.02950v110 citations
Originality Incremental advance
AI Analysis

This work provides a theoretical explanation for the observed sharpening of the weight loss landscape during adversarial training, which is a problem for researchers and practitioners aiming to improve the generalization of robust models.

This paper theoretically proves that adversarial training with L2 norm constraints sharpens the weight loss landscape in linear logistic regression models. The sharpness is attributed to noise aligned with the direction of increasing loss, and both theoretical and experimental results confirm that increasing noise magnitude leads to a sharper landscape in linear logistic regression and ResNet18.

Adversarial training is actively studied for learning robust models against adversarial examples. A recent study finds that adversarially trained models degenerate generalization performance on adversarial examples when their weight loss landscape, which is loss changes with respect to weights, is sharp. Unfortunately, it has been experimentally shown that adversarial training sharpens the weight loss landscape, but this phenomenon has not been theoretically clarified. Therefore, we theoretically analyze this phenomenon in this paper. As a first step, this paper proves that adversarial training with the L2 norm constraints sharpens the weight loss landscape in the linear logistic regression model. Our analysis reveals that the sharpness of the weight loss landscape is caused by the noise aligned in the direction of increasing the loss, which is used in adversarial training. We theoretically and experimentally confirm that the weight loss landscape becomes sharper as the magnitude of the noise of adversarial training increases in the linear logistic regression model. Moreover, we experimentally confirm the same phenomena in ResNet18 with softmax as a more general case.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes