ML LG STJul 24, 2020

Cross-validation Confidence Intervals for Test Error

Pierre Bayle, Alexandre Bayle, Lucas Janson, Lester Mackey

arXiv:2007.12671v215.564 citationsHas Code

Originality Incremental advance

AI Analysis

This provides a practical solution for researchers and practitioners needing reliable statistical inference in model evaluation, though it is incremental as it builds on existing cross-validation methods.

The paper tackles the problem of estimating confidence intervals for cross-validation test error by developing central limit theorems and consistent variance estimators under weak stability conditions, resulting in asymptotically-exact confidence intervals and hypothesis tests that outperform popular alternatives in real-data experiments.

This work develops central limit theorems for cross-validation and consistent estimators of its asymptotic variance under weak stability conditions on the learning algorithm. Together, these results provide practical, asymptotically-exact confidence intervals for $k$-fold test error and valid, powerful hypothesis tests of whether one learning algorithm has smaller $k$-fold test error than another. These results are also the first of their kind for the popular choice of leave-one-out cross-validation. In our real-data experiments with diverse learning algorithms, the resulting intervals and tests outperform the most popular alternative methods from the literature.

View on arXiv PDF Code

Similar