LG MLApr 8

Towards Accurate and Calibrated Classification: Regularizing Cross-Entropy From A Generative Perspective

arXiv:2604.0668920.7h-index: 2

AI Analysis

This addresses the trade-off between calibration and predictive performance in classification for machine learning practitioners, offering an incremental improvement over existing methods.

The paper tackled the problem of overconfident deep neural networks by proposing Generative Cross-Entropy (GCE), which improved both accuracy and calibration across datasets like CIFAR-10/100 and Tiny-ImageNet, especially in long-tailed scenarios, and achieved competitive calibration without sacrificing accuracy when combined with adaptive temperature scaling.

Accurate classification requires not only high predictive accuracy but also well-calibrated confidence estimates. Yet, modern deep neural networks (DNNs) are often overconfident, primarily due to overfitting on the negative log-likelihood (NLL). While focal loss variants alleviate this issue, they typically reduce accuracy, revealing a persistent trade-off between calibration and predictive performance. Motivated by the complementary strengths of generative and discriminative classifiers, we propose Generative Cross-Entropy (GCE), which maximizes $p(x|y)$ and is equivalent to cross-entropy augmented with a class-level confidence regularizer. Under mild conditions, GCE is strictly proper. Across CIFAR-10/100, Tiny-ImageNet, and a medical imaging benchmark, GCE improves both accuracy and calibration over cross-entropy, especially in the long-tailed scenario. Combined with adaptive piecewise temperature scaling (ATS), GCE attains calibration competitive with focal-loss variants without sacrificing accuracy.

View on arXiv PDF

Similar