LG AI CR CVMar 6, 2024

Improving Adversarial Training using Vulnerability-Aware Perturbation Budget

Olukorede Fakorede, Modeste Atsague, Jin Tian

arXiv:2403.04070v14.61 citationsh-index: 4

Originality Incremental advance

AI Analysis

This work addresses the incremental enhancement of adversarial robustness for deep neural networks by optimizing perturbation budgets during training.

The paper tackled the problem of improving adversarial training by proposing vulnerability-aware perturbation budgets, which assign varying perturbation radii based on individual example vulnerabilities, resulting in genuine robustness improvements against various adversarial attacks.

Adversarial Training (AT) effectively improves the robustness of Deep Neural Networks (DNNs) to adversarial attacks. Generally, AT involves training DNN models with adversarial examples obtained within a pre-defined, fixed perturbation bound. Notably, individual natural examples from which these adversarial examples are crafted exhibit varying degrees of intrinsic vulnerabilities, and as such, crafting adversarial examples with fixed perturbation radius for all instances may not sufficiently unleash the potency of AT. Motivated by this observation, we propose two simple, computationally cheap vulnerability-aware reweighting functions for assigning perturbation bounds to adversarial examples used for AT, named Margin-Weighted Perturbation Budget (MWPB) and Standard-Deviation-Weighted Perturbation Budget (SDWPB). The proposed methods assign perturbation radii to individual adversarial samples based on the vulnerability of their corresponding natural examples. Experimental results show that the proposed methods yield genuine improvements in the robustness of AT algorithms against various adversarial attacks.

View on arXiv PDF

Similar