Conformal Approach To Gaussian Process Surrogate Evaluation With Coverage Guarantees
This provides a robust tool for evaluating GP surrogate models in industrial applications like nuclear reactor simulation, though it is incremental as it adapts existing conformal prediction to GPs.
The authors tackled the issue of unreliable Bayesian credibility intervals in Gaussian process surrogate models by proposing a conformal prediction method that weights non-conformity scores with posterior standard deviation, resulting in adaptive intervals with frequentist coverage guarantees and significant correlation with local approximation error.
Gaussian processes (GPs) are a Bayesian machine learning approach widely used to construct surrogate models for the uncertainty quantification of computer simulation codes in industrial applications. It provides both a mean predictor and an estimate of the posterior prediction variance, the latter being used to produce Bayesian credibility intervals. Interpreting these intervals relies on the Gaussianity of the simulation model as well as the well-specification of the priors which are not always appropriate. We propose to address this issue with the help of conformal prediction. In the present work, a method for building adaptive cross-conformal prediction intervals is proposed by weighting the non-conformity score with the posterior standard deviation of the GP. The resulting conformal prediction intervals exhibit a level of adaptivity akin to Bayesian credibility sets and display a significant correlation with the surrogate model local approximation error, while being free from the underlying model assumptions and having frequentist coverage guarantees. These estimators can thus be used for evaluating the quality of a GP surrogate model and can assist a decision-maker in the choice of the best prior for the specific application of the GP. The performance of the method is illustrated through a panel of numerical examples based on various reference databases. Moreover, the potential applicability of the method is demonstrated in the context of surrogate modeling of an expensive-to-evaluate simulator of the clogging phenomenon in steam generators of nuclear reactors.