LGFeb 3, 2022

Certifying Out-of-Domain Generalization for Blackbox Functions

Maurice Weber, Linyi Li, Boxin Wang, Zhikuan Zhao, Bo Li, Ce Zhang

arXiv:2202.01679v210.420 citationsh-index: 32Has Code

Originality Highly original

AI Analysis

This addresses the challenge of ensuring model reliability under data distribution shifts for practitioners using large-scale deep learning, offering a scalable and flexible solution beyond existing limited techniques.

The paper tackles the problem of certifying distributional robustness for blackbox models and bounded loss functions, proposing a novel certification framework based on the Hellinger distance that scales to ImageNet-scale datasets and complex models, achieving non-vacuous certified out-of-domain generalization on ImageNet and outperforming state-of-the-art methods on smaller tasks.

Certifying the robustness of model performance under bounded data distribution drifts has recently attracted intensive interest under the umbrella of distributional robustness. However, existing techniques either make strong assumptions on the model class and loss functions that can be certified, such as smoothness expressed via Lipschitz continuity of gradients, or require to solve complex optimization problems. As a result, the wider application of these techniques is currently limited by its scalability and flexibility -- these techniques often do not scale to large-scale datasets with modern deep neural networks or cannot handle loss functions which may be non-smooth such as the 0-1 loss. In this paper, we focus on the problem of certifying distributional robustness for blackbox models and bounded loss functions, and propose a novel certification framework based on the Hellinger distance. Our certification technique scales to ImageNet-scale datasets, complex models, and a diverse set of loss functions. We then focus on one specific application enabled by such scalability and flexibility, i.e., certifying out-of-domain generalization for large neural networks and loss functions such as accuracy and AUC. We experimentally validate our certification method on a number of datasets, ranging from ImageNet, where we provide the first non-vacuous certified out-of-domain generalization, to smaller classification tasks where we are able to compare with the state-of-the-art and show that our method performs considerably better.

View on arXiv PDF Code

Similar