LG AIJun 14, 2025

Unveiling Confirmation Bias in Chain-of-Thought Reasoning

arXiv:2506.12301v119.710 citationsh-index: 4Has CodeACL

Originality Incremental advance

AI Analysis

This study addresses the problem of inconsistent reasoning performance in LLMs for researchers and practitioners, offering insights to improve prompting strategies, though it is incremental in nature.

The paper investigates confirmation bias in chain-of-thought reasoning of large language models, showing that model beliefs skew reasoning generation and answer prediction, and explains variations in CoT effectiveness across tasks and models.

Chain-of-thought (CoT) prompting has been widely adopted to enhance the reasoning capabilities of large language models (LLMs). However, the effectiveness of CoT reasoning is inconsistent across tasks with different reasoning types. This work presents a novel perspective to understand CoT behavior through the lens of \textit{confirmation bias} in cognitive psychology. Specifically, we examine how model internal beliefs, approximated by direct question-answering probabilities, affect both reasoning generation ($Q \to R$) and reasoning-guided answer prediction ($QR \to A$) in CoT. By decomposing CoT into a two-stage process, we conduct a thorough correlation analysis in model beliefs, rationale attributes, and stage-wise performance. Our results provide strong evidence of confirmation bias in LLMs, such that model beliefs not only skew the reasoning process but also influence how rationales are utilized for answer prediction. Furthermore, the interplay between task vulnerability to confirmation bias and the strength of beliefs also provides explanations for CoT effectiveness across reasoning tasks and models. Overall, this study provides a valuable insight for the needs of better prompting strategies that mitigate confirmation bias to enhance reasoning performance. Code is available at \textit{https://github.com/yuewan2/biasedcot}.

View on arXiv PDF Code

Similar