LG CVAug 1, 2023

Improving Generalization of Adversarial Training via Robust Critical Fine-Tuning

Kaijie Zhu, Jindong Wang, Xixu Hu, Xing Xie, Ge Yang

arXiv:2308.02533v118.839 citationsh-index: 18Has Code

Originality Incremental advance

AI Analysis

This addresses a security risk in critical applications by enhancing model robustness without sacrificing performance, though it is incremental as it builds on existing adversarial training methods.

The paper tackles the problem of adversarial training reducing generalization ability in deep neural networks by proposing Robustness Critical Fine-Tuning (RiFT), which improves generalization and out-of-distribution robustness by around 1.5% while maintaining adversarial robustness.

Deep neural networks are susceptible to adversarial examples, posing a significant security risk in critical applications. Adversarial Training (AT) is a well-established technique to enhance adversarial robustness, but it often comes at the cost of decreased generalization ability. This paper proposes Robustness Critical Fine-Tuning (RiFT), a novel approach to enhance generalization without compromising adversarial robustness. The core idea of RiFT is to exploit the redundant capacity for robustness by fine-tuning the adversarially trained model on its non-robust-critical module. To do so, we introduce module robust criticality (MRC), a measure that evaluates the significance of a given module to model robustness under worst-case weight perturbations. Using this measure, we identify the module with the lowest MRC value as the non-robust-critical module and fine-tune its weights to obtain fine-tuned weights. Subsequently, we linearly interpolate between the adversarially trained weights and fine-tuned weights to derive the optimal fine-tuned model weights. We demonstrate the efficacy of RiFT on ResNet18, ResNet34, and WideResNet34-10 models trained on CIFAR10, CIFAR100, and Tiny-ImageNet datasets. Our experiments show that \method can significantly improve both generalization and out-of-distribution robustness by around 1.5% while maintaining or even slightly enhancing adversarial robustness. Code is available at https://github.com/microsoft/robustlearn.

View on arXiv PDF Code

Similar