$L_2$-Regularized Empirical Risk Minimization Guarantees Small Smooth Calibration Error
This provides a foundational theoretical guarantee for calibration in ML, addressing reliability for users of probabilistic predictions.
The paper tackles the problem of ensuring calibration in machine learning models by proving that standard $L_2$-regularized empirical risk minimization directly controls smooth calibration error without post-hoc methods, with theoretical bounds and experimental validation.
Calibration of predicted probabilities is critical for reliable machine learning, yet it is poorly understood how standard training procedures yield well-calibrated models. This work provides the first theoretical proof that canonical $L_{2}$-regularized empirical risk minimization directly controls the smooth calibration error (smCE) without post-hoc correction or specialized calibration-promoting regularizer. We establish finite-sample generalization bounds for smCE based on optimization error, regularization strength, and the Rademacher complexity. We then instantiate this theory for models in reproducing kernel Hilbert spaces, deriving concrete guarantees for kernel ridge and logistic regression. Our experiments confirm these specific guarantees, demonstrating that $L_{2}$-regularized ERM can provide a well-calibrated model without boosting or post-hoc recalibration. The source code to reproduce all experiments is available at https://github.com/msfuji0211/erm_calibration.