CRAFT: A Unified Counterfactual Reasoning Framework for Tabular Question Answering and Fact Verification
For researchers working on table reasoning with LLMs, CRAFT provides a unified framework that improves multi-step inference accuracy and model robustness.
CRAFT reformulates tabular QA and fact verification into a bidirectional verification process using counterfactual reasoning, achieving consistent improvements over baselines on WikiTQ and TabFact, especially on complex questions, and reducing performance gaps across LLMs.
Table reasoning remains challenging for large language models (LLMs), particularly in tasks that require multi-step inference over long and structured tables. Existing approaches predominantly rely on single-direction reasoning, which limits their ability to explore alternative hypotheses across tasks. In this work, we propose CRAFT, a unified Counterfactual Reasoning Framework that reformulates Tabular question answering and fact verification into a general bidirectional verification process. Our method explicitly constructs both declarative statements and their counterfactual variants. Evidence is then extracted from reasoning along both the original and counterfactual paths, and integrated via a weighted mechanism to arrive at the final answer. Experimental results show that our approach consistently surpasses representative baselines on table reasoning datasets such as WikiTQ and TabFact, achieving especially large improvements on complex question answering. Our framework also significantly mitigates performance gaps between different backbone LLMs. This indicates that counterfactual reasoning effectively overcomes the limitations of single-direction inference, guiding LLMs toward more discerning reasoning and establishing a more principled paradigm for structured reasoning tasks. Our code will be made publicly available upon acceptance.