LG CR MLMay 23, 2023

Expressive Losses for Verified Robustness via Convex Combinations

Alessandro De Palma, Rudy Bunel, Krishnamurthy Dvijotham, M. Pawan Kumar, Robert Stanforth, Alessio Lomuscio

arXiv:2305.13991v317.029 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the problem of balancing accuracy and robustness for practitioners in adversarial machine learning, though it is incremental as it builds on existing methods like adversarial training and IBP bounds.

The paper tackles the trade-off between standard accuracy and verified adversarial robustness in neural networks by proposing expressive loss functions that span a range of trade-offs via a single parameter, achieving state-of-the-art results across various settings with conceptually simple convex combinations.

In order to train networks for verified adversarial robustness, it is common to over-approximate the worst-case loss over perturbation regions, resulting in networks that attain verifiability at the expense of standard performance. As shown in recent work, better trade-offs between accuracy and robustness can be obtained by carefully coupling adversarial training with over-approximations. We hypothesize that the expressivity of a loss function, which we formalize as the ability to span a range of trade-offs between lower and upper bounds to the worst-case loss through a single parameter (the over-approximation coefficient), is key to attaining state-of-the-art performance. To support our hypothesis, we show that trivial expressive losses, obtained via convex combinations between adversarial attacks and IBP bounds, yield state-of-the-art results across a variety of settings in spite of their conceptual simplicity. We provide a detailed analysis of the relationship between the over-approximation coefficient and performance profiles across different expressive losses, showing that, while expressivity is essential, better approximations of the worst-case loss are not necessarily linked to superior robustness-accuracy trade-offs.

View on arXiv PDF Code

Similar