AIAO-PHPLASM-PHAug 19, 2024

Uncertainty Quantification of Surrogate Models using Conformal Prediction

arXiv:2408.09881v311 citationsh-index: 48
Originality Highly original
AI Analysis

This addresses the need for trustworthy deployment of machine learning in safety-critical physical science applications where traditional uncertainty quantification methods struggle with computational costs and high-dimensional spatio-temporal problems.

The paper tackles the problem of uncertainty quantification for data-driven surrogate models, which typically lack reliability guarantees, by presenting a conformal prediction framework that provides statistically guaranteed marginal coverage with near-zero computational cost. The method achieves empirical coverage with valid error bars across diverse applications including fluid dynamics and weather forecasting, requiring only seconds to minutes for calibration on standard hardware.

Data-driven surrogate models offer quick approximations to complex numerical and experimental systems but typically lack uncertainty quantification, limiting their reliability in safety-critical applications. While Bayesian methods provide uncertainty estimates, they offer no statistical guarantees and struggle with high-dimensional spatio-temporal problems due to computational costs. We present a conformal prediction (CP) framework that provides statistically guaranteed marginal coverage for surrogate models in a model-agnostic manner with near-zero computational cost. Our approach handles high-dimensional spatio-temporal outputs by performing cell-wise calibration while preserving the tensorial structure of predictions. Through extensive empirical evaluation across diverse applications including fluid dynamics, magnetohydrodynamics, weather forecasting, and fusion diagnostics, we demonstrate that CP achieves empirical coverage with valid error bars regardless of model architecture, training regime, or output dimensionality. We evaluate three nonconformity scores (conformalised quantile regression, absolute error residual, and standard deviation) for both deterministic and probabilistic models, showing that guaranteed coverage holds even for out-of-distribution predictions where models are deployed on physics regimes different from training data. Calibration requires only seconds to minutes on standard hardware. The framework enables rigorous validation of pre-trained surrogate models for downstream applications without retraining. While CP provides marginal rather than conditional coverage and assumes exchangeability between calibration and test data, our method circumvents the curse of dimensionality inherent in traditional uncertainty quantification approaches, offering a practical tool for trustworthy deployment of machine learning in physical sciences.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes