CPAIAug 10, 2025

Can LLMs Identify Tax Abuse?

arXiv:2508.20097v1h-index: 15
Originality Synthesis-oriented
AI Analysis

This addresses the challenge of reducing tax revenue lost from well-advised, wealthy taxpayers, but it is incremental as it applies existing LLM methods to a new domain.

The paper tackled the problem of whether large language models can discover and analyze U.S. tax-minimization strategies, and found that LLM-based reasoning identified an entirely novel tax strategy.

We investigate whether large language models can discover and analyze U.S. tax-minimization strategies. This real-world domain challenges even seasoned human experts, and progress can reduce tax revenue lost from well-advised, wealthy taxpayers. We evaluate the most advanced LLMs on their ability to (1) interpret and verify tax strategies, (2) fill in gaps in partially specified strategies, and (3) generate complete, end-to-end strategies from scratch. This domain should be of particular interest to the LLM reasoning community: unlike synthetic challenge problems or scientific reasoning tasks, U.S. tax law involves navigating hundreds of thousands of pages of statutes, case law, and administrative guidance, all updated regularly. Notably, LLM-based reasoning identified an entirely novel tax strategy, highlighting these models' potential to revolutionize tax agencies' fight against tax abuse.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes