LGOct 19, 2025

SolverLLM: Leveraging Test-Time Scaling for Optimization Problem via LLM-Guided Search

Dong Li, Xujiang Zhao, Linlin Yu, Yanchi Liu, Wei Cheng, Zhengzhang Chen, Zhong Chen, Feng Chen, Chen Zhao, Haifeng Chen

arXiv:2510.16916v214.49 citationsh-index: 12

Originality Incremental advance

AI Analysis

This addresses the need for efficient and generalizable optimization solvers in AI applications, though it is incremental as it builds on existing MCTS methods with novel modifications.

The paper tackles the problem of solving diverse optimization problems using LLMs without costly training, by introducing SolverLLM, a training-free framework that generates mathematical formulations and solver-ready code via a modified MCTS strategy, achieving strong generalization and outperforming baselines on six benchmark datasets.

Large Language Models (LLMs) offer promising capabilities for tackling complex reasoning tasks, including optimization problems. However, existing methods either rely on prompt engineering, which leads to poor generalization across problem types, or require costly supervised training. We introduce SolverLLM, a training-free framework that leverages test-time scaling to solve diverse optimization problems. Rather than solving directly, SolverLLM generates mathematical formulations and translates them into solver-ready code, guided by a novel Monte Carlo Tree Search (MCTS) strategy. To enhance the search process, we modify classical MCTS with (1) dynamic expansion for adaptive formulation generation, (2) prompt backpropagation to guide exploration via outcome-driven feedback, and (3) uncertainty backpropagation to incorporate reward reliability into decision-making. Experiments on six standard benchmark datasets demonstrate that SolverLLM outperforms both prompt-based and learning-based baselines, achieving strong generalization without additional training.

View on arXiv PDF

Similar