AI CLMay 20, 2025

Reasoning Models Better Express Their Confidence

Dongkeun Yoon, Seungone Kim, Sohee Yang, Sunkyoung Kim, Soyeon Kim, Yongil Kim, Eunbi Choi, Yireun Kim, Minjoon Seo

CMUDeepMind

arXiv:2505.14489v231.654 citationsh-index: 21Has Code

Originality Incremental advance

AI Analysis

This addresses the reliability issue for users of LLMs by improving confidence calibration, though it is incremental as it builds on existing reasoning methods.

The paper tackles the problem of large language models (LLMs) inaccurately expressing confidence, limiting reliability, and finds that reasoning models with chain-of-thought (CoT) reasoning achieve strictly better confidence calibration than non-reasoning models in 33 out of 36 settings, with gains linked to slow thinking behaviors.

Despite their strengths, large language models (LLMs) often fail to communicate their confidence accurately, making it difficult to assess when they might be wrong and limiting their reliability. In this work, we demonstrate that reasoning models that engage in extended chain-of-thought (CoT) reasoning exhibit superior performance not only in problem-solving but also in accurately expressing their confidence. Specifically, we benchmark six reasoning models across six datasets and find that they achieve strictly better confidence calibration than their non-reasoning counterparts in 33 out of the 36 settings. Our detailed analysis reveals that these gains in calibration stem from the slow thinking behaviors of reasoning models (e.g., exploring alternative approaches and backtracking) which enable them to adjust their confidence dynamically throughout their CoT, making it progressively more accurate. In particular, we find that reasoning models become increasingly better calibrated as their CoT unfolds, a trend not observed in non-reasoning models. Moreover, removing slow thinking behaviors from the CoT leads to a significant drop in calibration. Lastly, we show that non-reasoning models also demonstrate enhanced calibration when simply guided to slow think via in-context learning, fully isolating slow thinking as the source of the calibration gains.

View on arXiv PDF Code

Similar