LGNov 21, 2021

Local Linearity and Double Descent in Catastrophic Overfitting

arXiv:2111.10754v1

Originality Incremental advance

AI Analysis

This addresses a specific robustness issue in adversarial training for neural networks, but the findings are incremental, building on prior work.

The paper investigates catastrophic overfitting in adversarial training with FGSM, showing that high local linearity is sufficient but not necessary to prevent it, and identifies a double descent phenomenon during training.

Catastrophic overfitting is a phenomenon observed during Adversarial Training (AT) with the Fast Gradient Sign Method (FGSM) where the test robustness steeply declines over just one epoch in the training stage. Prior work has attributed this loss in robustness to a sharp decrease in $\textit{local linearity}$ of the neural network with respect to the input space, and has demonstrated that introducing a local linearity measure as a regularization term prevents catastrophic overfitting. Using a simple neural network architecture, we experimentally demonstrate that maintaining high local linearity might be $\textit{sufficient}$ to prevent catastrophic overfitting but is not $\textit{necessary.}$ Further, inspired by Parseval networks, we introduce a regularization term to AT with FGSM to make the weight matrices of the network orthogonal and study the connection between orthogonality of the network weights and local linearity. Lastly, we identify the $\textit{double descent}$ phenomenon during the adversarial training process.

View on arXiv PDF

Similar