MLLGSTJul 24, 2020

Cross-validation Confidence Intervals for Test Error

arXiv:2007.12671v264 citations
AI Analysis

This provides a practical solution for researchers and practitioners needing reliable statistical inference in model evaluation, though it is incremental as it builds on existing cross-validation methods.

The paper tackles the problem of estimating confidence intervals for cross-validation test error by developing central limit theorems and consistent variance estimators under weak stability conditions, resulting in asymptotically-exact confidence intervals and hypothesis tests that outperform popular alternatives in real-data experiments.

This work develops central limit theorems for cross-validation and consistent estimators of its asymptotic variance under weak stability conditions on the learning algorithm. Together, these results provide practical, asymptotically-exact confidence intervals for $k$-fold test error and valid, powerful hypothesis tests of whether one learning algorithm has smaller $k$-fold test error than another. These results are also the first of their kind for the popular choice of leave-one-out cross-validation. In our real-data experiments with diverse learning algorithms, the resulting intervals and tests outperform the most popular alternative methods from the literature.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes