LGAICRMLFeb 24, 2021

Adversarial Robustness with Non-uniform Perturbations

arXiv:2102.12002v434 citations
Originality Incremental advance
AI Analysis

This addresses security-critical applications where uniform perturbations are unrealistic, offering a domain-specific improvement.

The paper tackles the problem of adversarial robustness in domains like malware, finance, and social networks by proposing non-uniform perturbations that account for feature dependencies, showing improved robustness to real-world attacks and better certification results.

Robustness of machine learning models is critical for security related applications, where real-world adversaries are uniquely focused on evading neural network based detectors. Prior work mainly focus on crafting adversarial examples (AEs) with small uniform norm-bounded perturbations across features to maintain the requirement of imperceptibility. However, uniform perturbations do not result in realistic AEs in domains such as malware, finance, and social networks. For these types of applications, features typically have some semantically meaningful dependencies. The key idea of our proposed approach is to enable non-uniform perturbations that can adequately represent these feature dependencies during adversarial training. We propose using characteristics of the empirical data distribution, both on correlations between the features and the importance of the features themselves. Using experimental datasets for malware classification, credit risk prediction, and spam detection, we show that our approach is more robust to real-world attacks. Finally, we present robustness certification utilizing non-uniform perturbation bounds, and show that non-uniform bounds achieve better certification.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes