LGSep 26, 2025

Teaching Transformers to Solve Combinatorial Problems through Efficient Trial & Error

arXiv:2509.22023v12 citationsh-index: 7
Originality Incremental advance
AI Analysis

This addresses a gap in LLMs' ability to handle NP-class problems like Sudoku, offering a domain-specific improvement.

The paper tackles the challenge of enabling Large Language Models to solve combinatorial problems, specifically Sudoku, achieving state-of-the-art accuracy of 99% compared to prior neuro-symbolic approaches.

Despite their proficiency in various language tasks, Large Language Models (LLMs) struggle with combinatorial problems like Satisfiability, Traveling Salesman Problem, or even basic arithmetic. We address this gap through a novel approach for solving problems in the class NP. We focus on the paradigmatic task of Sudoku and achieve state-of-the-art accuracy (99\%) compared to prior neuro-symbolic approaches. Unlike prior work that used custom architectures, our method employs a vanilla decoder-only Transformer (GPT-2) without external tools or function calling. Our method integrates imitation learning of simple Sudoku rules with an explicit Depth-First Search (DFS) exploration strategy involving informed guessing and backtracking. Moving beyond imitation learning, we seek to minimize the number of guesses until reaching a solution. We provide a rigorous analysis of this setup formalizing its connection to a contextual variant of Min-Sum Set Cover, a well-studied problem in algorithms and stochastic optimization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes