CLMar 1

Reasoning or Rationalization? The Role of Justifications in Masked Diffusion Models for Fact Verification

arXiv:2603.01190v1h-index: 4
Originality Incremental advance
AI Analysis

This work addresses the problem of understanding reasoning dynamics in diffusion models for fact verification, revealing potential pitfalls in justification strategies, but it is incremental as it builds on existing models and tasks.

The study investigated whether justifications in Masked Diffusion Language Models for fact verification serve as genuine reasoning or post-hoc rationalization, finding that these models converge on verdicts early and that forcing reasoning-first constraints degrades accuracy from 86.2% to 71.9% due to inconsistencies from justification tokens.

Unlike autoregressive models, which generate tokens sequentially and benefit from reasoning-before-answering strategies such as Chain-of-Thought, Masked Diffusion Language Models (MDLMs) refine all sequence positions simultaneously, raising questions about how these models handle tasks requiring justified verdicts. In this work, we investigate the dynamics of MDLM reasoning on fact verification, examining whether justifications serve as genuine reasoning or post-hoc rationalization. We observe that MDLMs typically converge on a verdict early in the diffusion process, treating it as a global anchor that is resolved before the justification is complete. Crucially, enforcing a reasoning-first constraint via delayed verdict unmasking actively degrades performance, dropping accuracy from 86.2% to 71.9% as accumulating justification tokens introduce inconsistencies that override initially correct predictions. Interventional experiments reveal that the model rationalizes incorrect forced verdicts in 56% of cases, and that verdicts are strongly causally dependent on justification quality (57.3% accuracy with corrupted justifications vs. 97.1% with ground-truth). This causal dependence explains the degradation under forced deliberation: as the model generates noisy justification tokens, it conditions on them, gradually overriding its initially correct assessment. Our findings suggest that for fact verification with MDLMs, extended deliberation can be counterproductive, risking the dilution of accurate early predictions with noise introduced during justification generation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes