LGFeb 15, 2021

Guided Interpolation for Adversarial Training

arXiv:2102.07327v110 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving adversarial robustness in deep neural networks, which is an incremental advancement in adversarial training techniques.

The paper tackles the problem of adversarial training becoming less effective as training data becomes less attackable, proposing a guided interpolation framework (GIF) that uses meta information to guide data interpolation, resulting in enhanced adversarial robustness across various methods and datasets.

To enhance adversarial robustness, adversarial training learns deep neural networks on the adversarial variants generated by their natural data. However, as the training progresses, the training data becomes less and less attackable, undermining the robustness enhancement. A straightforward remedy is to incorporate more training data, but sometimes incurring an unaffordable cost. In this paper, to mitigate this issue, we propose the guided interpolation framework (GIF): in each epoch, the GIF employs the previous epoch's meta information to guide the data's interpolation. Compared with the vanilla mixup, the GIF can provide a higher ratio of attackable data, which is beneficial to the robustness enhancement; it meanwhile mitigates the model's linear behavior between classes, where the linear behavior is favorable to generalization but not to the robustness. As a result, the GIF encourages the model to predict invariantly in the cluster of each class. Experiments demonstrate that the GIF can indeed enhance adversarial robustness on various adversarial training methods and various datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes