MLLGCOJan 24, 2025

coverforest: Conformal Predictions with Random Forest in Python

arXiv:2501.14570v21 citationsh-index: 2Has CodeNeurocomputing
Originality Synthesis-oriented
AI Analysis

This work provides a practical tool for researchers and practitioners in machine learning who need efficient uncertainty quantification with conformal predictions, though it is incremental as it builds on existing methods with optimizations.

The paper tackles the computational inefficiency of cross-conformal prediction methods like CV+ and Jackknife+-after-bootstrap by developing coverforest, a Python package optimized for random forests, resulting in training and prediction times that are 2–9 times faster than existing implementations while maintaining desired coverage levels.

Conformal prediction provides a framework for uncertainty quantification, specifically in the forms of prediction intervals and sets with distribution-free guaranteed coverage. While recent cross-conformal techniques such as CV+ and Jackknife+-after-bootstrap achieve better data efficiency than traditional split conformal methods, they incur substantial computational costs due to required pairwise comparisons between training and test samples' out-of-bag scores. Observing that these methods naturally extend from ensemble models, particularly random forests, we leverage existing optimized random forest implementations to enable efficient cross-conformal predictions. We present coverforest, a Python package that implements efficient conformal prediction methods specifically optimized for random forests. coverforest supports both regression and classification tasks through various conformal prediction methods, including split conformal, CV+, Jackknife+-after-bootstrap, and adaptive prediction sets. Our package leverages parallel computing and Cython optimizations to speed up out-of-bag calculations. Our experiments demonstrate that coverforest's predictions achieve the desired level of coverage. In addition, its training and prediction times can be faster than an existing implementation by 2--9 times. The source code for the coverforest is hosted on GitHub at https://github.com/donlap/coverforest.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes