ML CR LGMay 19, 2024

Certified Robust Accuracy of Neural Networks Are Bounded due to Bayes Errors

arXiv:2405.11547v212.08 citationsh-index: 42CAV

Originality Incremental advance

AI Analysis

This work addresses a fundamental limitation in adversarial robustness for neural networks, which is crucial for security-critical systems, by providing a theoretical explanation for why accuracy drops when pursuing robustness, though it is incremental as it builds on existing certified training frameworks.

The paper tackles the problem of the trade-off between certified robustness and accuracy in neural networks by analyzing it through the lens of Bayes errors, establishing an upper bound for certified robust accuracy that explains the limited success of existing methods, with empirical results showing an upper bound of 67.49% for CIFAR10 while current approaches only reach 62.84%.

Adversarial examples pose a security threat to many critical systems built on neural networks. While certified training improves robustness, it also decreases accuracy noticeably. Despite various proposals for addressing this issue, the significant accuracy drop remains. More importantly, it is not clear whether there is a certain fundamental limit on achieving robustness whilst maintaining accuracy. In this work, we offer a novel perspective based on Bayes errors. By adopting Bayes error to robustness analysis, we investigate the limit of certified robust accuracy, taking into account data distribution uncertainties. We first show that the accuracy inevitably decreases in the pursuit of robustness due to changed Bayes error in the altered data distribution. Subsequently, we establish an upper bound for certified robust accuracy, considering the distribution of individual classes and their boundaries. Our theoretical results are empirically evaluated on real-world datasets and are shown to be consistent with the limited success of existing certified training results, e.g., for CIFAR10, our analysis results in an upper bound (of certified robust accuracy) of 67.49\%, meanwhile existing approaches are only able to increase it from 53.89\% in 2017 to 62.84\% in 2023.

View on arXiv PDF

Similar