LG ST MLApr 25, 2024

A Notion of Uniqueness for the Adversarial Bayes Classifier

arXiv:2404.16956v36.42 citationsh-index: 2SIAM J Math Data Sci

Originality Incremental advance

AI Analysis

This work provides theoretical tools for understanding adversarial robustness in machine learning, though it is incremental as it focuses on one-dimensional settings.

The authors tackled the problem of characterizing adversarial Bayes classifiers in binary classification by proposing a new uniqueness notion, which enabled them to compute all such classifiers for one-dimensional data distributions and show that regularity improves with increasing perturbation radius.

We propose a new notion of uniqueness for the adversarial Bayes classifier in the setting of binary classification. Analyzing this concept produces a simple procedure for computing all adversarial Bayes classifiers for a well-motivated family of one dimensional data distributions. This characterization is then leveraged to show that as the perturbation radius increases, certain notions of regularity for the adversarial Bayes classifiers improve. Furthermore, these results provide tools for understanding relationships between the Bayes and adversarial Bayes classifiers in one dimension.

View on arXiv PDF

Similar