CLAug 7, 2025

Learning to Reason for Factuality

Xilun Chen, Ilia Kulikov, Vincent-Pierre Berges, Barlas Oğuz, Rulin Shao, Gargi Ghosh, Jason Weston, Wen-tau Yih

Meta AI

arXiv:2508.05618v115 citationsh-index: 28

Originality Incremental advance

AI Analysis

This addresses the issue of hallucinations in reasoning models for applications requiring accurate long-form content, representing an incremental improvement over existing methods.

The paper tackled the problem of factuality in reasoning large language models, which generate more hallucinations than non-reasoning models, by proposing a novel reward function for online reinforcement learning that improved factual precision, detail, and relevance, resulting in a 23.1 percentage point reduction in hallucination rate and a 23% increase in answer detail on benchmarks.

Reasoning Large Language Models (R-LLMs) have significantly advanced complex reasoning tasks but often struggle with factuality, generating substantially more hallucinations than their non-reasoning counterparts on long-form factuality benchmarks. However, extending online Reinforcement Learning (RL), a key component in recent R-LLM advancements, to the long-form factuality setting poses several unique challenges due to the lack of reliable verification methods. Previous work has utilized automatic factuality evaluation frameworks such as FActScore to curate preference data in the offline RL setting, yet we find that directly leveraging such methods as the reward in online RL leads to reward hacking in multiple ways, such as producing less detailed or relevant responses. We propose a novel reward function that simultaneously considers the factual precision, response detail level, and answer relevance, and applies online RL to learn high quality factual reasoning. Evaluated on six long-form factuality benchmarks, our factual reasoning model achieves an average reduction of 23.1 percentage points in hallucination rate, a 23% increase in answer detail level, and no degradation in the overall response helpfulness.

View on arXiv PDF

Similar