LGCVAug 23, 2024

Dynamic Label Adversarial Training for Deep Learning Robustness Against Adversarial Attacks

arXiv:2408.13102v13 citationsh-index: 21
Originality Incremental advance
AI Analysis

This work addresses robustness issues in deep learning models for security-critical applications, presenting an incremental improvement over existing adversarial training methods.

The paper tackled the problem of robust overfitting and sub-optimal clean accuracy in adversarial training by proposing a dynamic label adversarial training algorithm, achieving improved robustness against adversarial attacks as validated through extensive experiments.

Adversarial training is one of the most effective methods for enhancing model robustness. Recent approaches incorporate adversarial distillation in adversarial training architectures. However, we notice two scenarios of defense methods that limit their performance: (1) Previous methods primarily use static ground truth for adversarial training, but this often causes robust overfitting; (2) The loss functions are either Mean Squared Error or KL-divergence leading to a sub-optimal performance on clean accuracy. To solve those problems, we propose a dynamic label adversarial training (DYNAT) algorithm that enables the target model to gradually and dynamically gain robustness from the guide model's decisions. Additionally, we found that a budgeted dimension of inner optimization for the target model may contribute to the trade-off between clean accuracy and robust accuracy. Therefore, we propose a novel inner optimization method to be incorporated into the adversarial training. This will enable the target model to adaptively search for adversarial examples based on dynamic labels from the guiding model, contributing to the robustness of the target model. Extensive experiments validate the superior performance of our approach.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes