LGAICRCVMar 6, 2024

Improving Adversarial Training using Vulnerability-Aware Perturbation Budget

arXiv:2403.04070v11 citationsh-index: 4
Originality Incremental advance
AI Analysis

This work addresses the incremental enhancement of adversarial robustness for deep neural networks by optimizing perturbation budgets during training.

The paper tackled the problem of improving adversarial training by proposing vulnerability-aware perturbation budgets, which assign varying perturbation radii based on individual example vulnerabilities, resulting in genuine robustness improvements against various adversarial attacks.

Adversarial Training (AT) effectively improves the robustness of Deep Neural Networks (DNNs) to adversarial attacks. Generally, AT involves training DNN models with adversarial examples obtained within a pre-defined, fixed perturbation bound. Notably, individual natural examples from which these adversarial examples are crafted exhibit varying degrees of intrinsic vulnerabilities, and as such, crafting adversarial examples with fixed perturbation radius for all instances may not sufficiently unleash the potency of AT. Motivated by this observation, we propose two simple, computationally cheap vulnerability-aware reweighting functions for assigning perturbation bounds to adversarial examples used for AT, named Margin-Weighted Perturbation Budget (MWPB) and Standard-Deviation-Weighted Perturbation Budget (SDWPB). The proposed methods assign perturbation radii to individual adversarial samples based on the vulnerability of their corresponding natural examples. Experimental results show that the proposed methods yield genuine improvements in the robustness of AT algorithms against various adversarial attacks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes