CEAICLOct 15, 2025

Program of Thoughts for Financial Reasoning: Leveraging Dynamic In-Context Examples and Generative Retrieval

arXiv:2510.13157v14 citationsh-index: 6EMNLP
Originality Incremental advance
AI Analysis

This work addresses financial numerical reasoning problems for users in finance and AI, representing an incremental advancement over existing methods.

The paper tackles the challenge of numerical reasoning in large language models (LLMs) for financial tasks, introducing FINDER, a two-step framework that achieves state-of-the-art performance with execution accuracy improvements of 5.98% on FinQA and 4.05% on ConvFinQA datasets.

Despite continuous advancements in the capabilities of large language models (LLMs), numerical reasoning remains a challenging area. Techniques like chain-of-thought prompting, tree-of-thought prompting, and program-of-thought prompting guide LLMs through intermediate reasoning steps. Although in-context learning with few-shot prompting has improved performance, LLMs still lag behind state-of-the-art models on financial numerical reasoning datasets such as FinQA and ConvFinQA. In this work, we introduce FINDER, a novel two-step framework, to enhance LLMs' capabilities in financial numerical reasoning. The first step utilizes a generative retriever to extract relevant facts from unstructured data, including both text and tables. This is followed by context-aware Program of Thought prompting with dynamic selection of in-context examples. Our model FINDER achieves a new state-of-the-art performance on both the FinQA and ConvFinQA datasets, surpassing previous benchmarks with execution accuracy improvements of 5.98% and 4.05%, respectively.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes