CL AI CYAug 28, 2025

Language Models and Logic Programs for Trustworthy Financial Reasoning

William Jurayj, Nils Holzenberger, Benjamin Van Durme

arXiv:2508.21051v26.73 citationsh-index: 15

Originality Incremental advance

AI Analysis

This work addresses the problem of making tax assistance more reliable and accessible for taxpayers, though it is incremental as it builds on existing neuro-symbolic methods.

The authors tackled the problem of automating tax filing, which requires complex reasoning and high accuracy to avoid costly penalties, by integrating large language models with a symbolic solver. Their approach improved performance on the SARA dataset and reduced estimated costs below real-world averages.

According to the United States Internal Revenue Service, ''the average American spends $\$270$ and 13 hours filing their taxes''. Even beyond the U.S., tax filing requires complex reasoning, combining application of overlapping rules with numerical calculations. Because errors can incur costly penalties, any automated system must deliver high accuracy and auditability, making modern large language models (LLMs) poorly suited for this task. We propose an approach that integrates LLMs with a symbolic solver to calculate tax obligations. We evaluate variants of this system on the challenging StAtutory Reasoning Assessment (SARA) dataset, and include a novel method for estimating the cost of deploying such a system based on real-world penalties for tax errors. We further show how combining up-front translation of plain-text rules into formal logic programs, combined with intelligently retrieved exemplars for formal case representations, can dramatically improve performance on this task and reduce costs to well below real-world averages. Our results demonstrate the promise and economic feasibility of neuro-symbolic architectures for increasing equitable access to reliable tax assistance.

View on arXiv PDF

Similar