LG MLOct 14, 2024

Towards Calibrated Losses for Adversarial Robust Reject Option Classification

Vrund Shah, Tejas Chaudhari, Naresh Manwani

arXiv:2410.10736v14.61 citationsh-index: 12Has CodeACML

Originality Synthesis-oriented

AI Analysis

This work addresses the need for reliable classifiers in high-stakes applications like autonomous driving and medical diagnosis by enabling abstention under adversarial perturbations, though it is incremental as it builds on existing adversarial robustness and reject option frameworks.

The paper tackles the problem of designing calibrated surrogate losses for adversarial robust reject option classification, where classifiers can abstain from predictions under adversarial attacks, and demonstrates that shifted Double Ramp Loss and shifted Double Sigmoid Loss satisfy calibration conditions and show robustness on synthetic data.

Robustness towards adversarial attacks is a vital property for classifiers in several applications such as autonomous driving, medical diagnosis, etc. Also, in such scenarios, where the cost of misclassification is very high, knowing when to abstain from prediction becomes crucial. A natural question is which surrogates can be used to ensure learning in scenarios where the input points are adversarially perturbed and the classifier can abstain from prediction? This paper aims to characterize and design surrogates calibrated in "Adversarial Robust Reject Option" setting. First, we propose an adversarial robust reject option loss $\ell_{d}^γ$ and analyze it for the hypothesis set of linear classifiers ($\mathcal{H}_{\textrm{lin}}$). Next, we provide a complete characterization result for any surrogate to be $(\ell_{d}^γ,\mathcal{H}_{\textrm{lin}})$- calibrated. To demonstrate the difficulty in designing surrogates to $\ell_{d}^γ$, we show negative calibration results for convex surrogates and quasi-concave conditional risk cases (these gave positive calibration in adversarial setting without reject option). We also empirically argue that Shifted Double Ramp Loss (DRL) and Shifted Double Sigmoid Loss (DSL) satisfy the calibration conditions. Finally, we demonstrate the robustness of shifted DRL and shifted DSL against adversarial perturbations on a synthetically generated dataset.

View on arXiv PDF Code

Similar