LGSep 4, 2024

ForeCal: Random Forest-based Calibration for DNNs

arXiv:2409.02446v1

AI Analysis

This addresses calibration issues in DNNs for applications requiring reliable probability estimates, though it is an incremental improvement over existing post-hoc methods.

The paper tackled the problem of miscalibrated outputs in deep neural network classifiers by proposing ForeCal, a random forest-based post-hoc calibration algorithm, which outperformed existing methods on 43 datasets with lower Expected Calibration Error while maintaining discriminative power.

Deep neural network(DNN) based classifiers do extremely well in discriminating between observations, resulting in higher ROC AUC and accuracy metrics, but their outputs are often miscalibrated with respect to true event likelihoods. Post-hoc calibration algorithms are often used to calibrate the outputs of these classifiers. Methods like Isotonic regression, Platt scaling, and Temperature scaling have been shown to be effective in some cases but are limited by their parametric assumptions and/or their inability to capture complex non-linear relationships. We propose ForeCal - a novel post-hoc calibration algorithm based on Random forests. ForeCal exploits two unique properties of Random forests: the ability to enforce weak monotonicity and range-preservation. It is more powerful in achieving calibration than current state-of-the-art methods, is non-parametric, and can incorporate exogenous information as features to learn a better calibration function. Through experiments on 43 diverse datasets from the UCI ML repository, we show that ForeCal outperforms existing methods in terms of Expected Calibration Error(ECE) with minimal impact on the discriminative power of the base DNN as measured by AUC.

View on arXiv PDF

Similar