CLFeb 20, 2024

FormulaReasoning: A Dataset for Formula-Based Numerical Reasoning

arXiv:2402.12692v61 citationsh-index: 2
Originality Synthesis-oriented
AI Analysis

This dataset addresses a gap for researchers in AI and education by enabling better evaluation of formula-based reasoning methods, though it is incremental as it builds on existing numerical reasoning work.

The authors tackled the lack of explicit formula use in numerical reasoning datasets by introducing FormulaReasoning, a dataset of 5,324 questions requiring physics-based calculations, with detailed annotations in English and Chinese.

The application of formulas (e.g., physics formulas) is a fundamental human ability in solving numerical reasoning problems. Existing numerical reasoning datasets rarely explicitly state the formulas employed, as their questions often rely on implicit commonsense mathematical knowledge. To address this gap, we introduce FormulaReasoning, a new dataset specifically designed for formula-based numerical reasoning. It consists of 5,324 questions that require numerical calculations grounded in external physics formulas. We provide normalized, fine-grained annotations in both English and Chinese, including formula structures, parameter names, symbols, numerical values, and units-curated through extensive manual effort with LLM-assisted validation to ensure high quality. Additionally, we offer a consolidated formula database to serve as an external knowledge source. We analyze various reasoning approaches on FormulaReasoning, with emphasis on comparative evaluation of different architectural and methodological frameworks. Our assessment includes retrieval-augmented methods, approaches that decompose reasoning into formula generation, parameter extraction, and numerical calculation, as well as optimization techniques using preference data. We identify key challenges in formula-based numerical reasoning that require further investigation across different reasoning paradigms, highlighting opportunities for methodological advancement.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes