MLLGJan 31, 2025

Optimal Transport-based Conformal Prediction

arXiv:2501.18991v220 citationsh-index: 10ICML
Originality Highly original
AI Analysis

This addresses the problem of improving uncertainty quantification for multivariate learning tasks in machine learning, offering a more geometrically-aware method compared to existing approaches.

The paper tackles the limitation of traditional conformal prediction methods that use scalar nonconformity scores, which fail to capture geometric structure in multivariate outputs like multi-output regression or multiclass classification. They introduce a novel approach using optimal transport to construct flexible, potentially non-convex prediction regions, proving finite-sample coverage guarantees and demonstrating advantages in coverage and efficiency on practical problems.

Conformal Prediction (CP) is a principled framework for quantifying uncertainty in blackbox learning models, by constructing prediction sets with finite-sample coverage guarantees. Traditional approaches rely on scalar nonconformity scores, which fail to fully exploit the geometric structure of multivariate outputs, such as in multi-output regression or multiclass classification. Recent methods addressing this limitation impose predefined convex shapes for the prediction sets, potentially misaligning with the intrinsic data geometry. We introduce a novel CP procedure handling multivariate score functions through the lens of optimal transport. Specifically, we leverage Monge-Kantorovich vector ranks and quantiles to construct prediction region with flexible, potentially non-convex shapes, better suited to the complex uncertainty patterns encountered in multivariate learning tasks. We prove that our approach ensures finite-sample, distribution-free coverage properties, similar to typical CP methods. We then adapt our method for multi-output regression and multiclass classification, and also propose simple adjustments to generate adaptive prediction regions with asymptotic conditional coverage guarantees. Finally, we evaluate our method on practical regression and classification problems, illustrating its advantages in terms of (conditional) coverage and efficiency.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes