LG AI CAJan 12, 2023

Phase-shifted Adversarial Training

Yeachan Kim, Seongyeon Kim, Ihyeok Seo, Bonggun Shin

arXiv:2301.04785v33.81 citationsh-index: 12

Originality Incremental advance

AI Analysis

This work addresses the robustness of neural networks for real-world deployment by improving adversarial training efficiency, though it is incremental as it builds on existing frequency analysis methods.

The paper tackled the problem of adversarial training causing neural networks to have low convergence to high-frequency information, leading to oscillated predictions, and proposed Phase-shifted Adversarial Training (PhaseAT) to improve this, resulting in significantly enhanced adversarial robustness on CIFAR-10 and ImageNet.

Adversarial training has been considered an imperative component for safely deploying neural network-based applications to the real world. To achieve stronger robustness, existing methods primarily focus on how to generate strong attacks by increasing the number of update steps, regularizing the models with the smoothed loss function, and injecting the randomness into the attack. Instead, we analyze the behavior of adversarial training through the lens of response frequency. We empirically discover that adversarial training causes neural networks to have low convergence to high-frequency information, resulting in highly oscillated predictions near each data. To learn high-frequency contents efficiently and effectively, we first prove that a universal phenomenon of frequency principle, i.e., \textit{lower frequencies are learned first}, still holds in adversarial training. Based on that, we propose phase-shifted adversarial training (PhaseAT) in which the model learns high-frequency components by shifting these frequencies to the low-frequency range where the fast convergence occurs. For evaluations, we conduct the experiments on CIFAR-10 and ImageNet with the adaptive attack carefully designed for reliable evaluation. Comprehensive results show that PhaseAT significantly improves the convergence for high-frequency information. This results in improved adversarial robustness by enabling the model to have smoothed predictions near each data.

View on arXiv PDF

Similar