CV LGNov 29, 2023

Topology-preserving Adversarial Training for Alleviating Natural Accuracy Degradation

Xiaoyue Mi, Fan Tang, Yepeng Weng, Danding Wang, Juan Cao, Sheng Tang, Peng Li, Yang Liu

arXiv:2311.17607v23.91 citationsh-index: 17Has Code

Originality Incremental advance

AI Analysis

This addresses a key issue for improving robustness in neural networks without sacrificing natural accuracy, though it is incremental as it builds on existing adversarial training methods.

The paper tackles the problem of natural accuracy degradation in adversarial training by revealing its link to disruption of natural sample topology in representation space, and proposes TRAIN to preserve this topology, achieving up to 8.86% improvement in natural accuracy and 6.33% in robust accuracy on datasets like CIFAR-10.

Despite the effectiveness in improving the robustness of neural networks, adversarial training has suffered from the natural accuracy degradation problem, i.e., accuracy on natural samples has reduced significantly. In this study, we reveal that natural accuracy degradation is highly related to the disruption of the natural sample topology in the representation space by quantitative and qualitative experiments. Based on this observation, we propose Topology-pReserving Adversarial traINing (TRAIN) to alleviate the problem by preserving the topology structure of natural samples from a standard model trained only on natural samples during adversarial training. As an additional regularization, our method can be combined with various popular adversarial training algorithms, taking advantage of both sides. Extensive experiments on CIFAR-10, CIFAR-100, and Tiny ImageNet show that our proposed method achieves consistent and significant improvements over various strong baselines in most cases. Specifically, without additional data, TRAIN achieves up to 8.86% improvement in natural accuracy and 6.33% improvement in robust accuracy.

View on arXiv PDF Code

Similar