LGSTMLApr 25, 2024

A Notion of Uniqueness for the Adversarial Bayes Classifier

arXiv:2404.16956v32 citationsh-index: 2SIAM J Math Data Sci
Originality Incremental advance
AI Analysis

This work provides theoretical tools for understanding adversarial robustness in machine learning, though it is incremental as it focuses on one-dimensional settings.

The authors tackled the problem of characterizing adversarial Bayes classifiers in binary classification by proposing a new uniqueness notion, which enabled them to compute all such classifiers for one-dimensional data distributions and show that regularity improves with increasing perturbation radius.

We propose a new notion of uniqueness for the adversarial Bayes classifier in the setting of binary classification. Analyzing this concept produces a simple procedure for computing all adversarial Bayes classifiers for a well-motivated family of one dimensional data distributions. This characterization is then leveraged to show that as the perturbation radius increases, certain notions of regularity for the adversarial Bayes classifiers improve. Furthermore, these results provide tools for understanding relationships between the Bayes and adversarial Bayes classifiers in one dimension.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes