AIFeb 22, 2025

Dynamic Parallel Tree Search for Efficient LLM Reasoning

arXiv:2502.16235v241 citationsh-index: 19ACL
Originality Incremental advance
AI Analysis

This addresses the problem of slow reasoning for users of large language models, offering an incremental improvement in efficiency.

The paper tackled the computational inefficiency of Tree of Thoughts (ToT) methods in LLM reasoning by proposing Dynamic Parallel Tree Search (DPTS), which improved efficiency by 2-4x on average while maintaining or surpassing accuracy on Math500 and GSM8K datasets.

Tree of Thoughts (ToT) enhances Large Language Model (LLM) reasoning by structuring problem-solving as a spanning tree. However, recent methods focus on search accuracy while overlooking computational efficiency. The challenges of accelerating the ToT lie in the frequent switching of reasoning focus, and the redundant exploration of suboptimal solutions. To alleviate this dilemma, we propose Dynamic Parallel Tree Search (DPTS), a novel parallelism framework that aims to dynamically optimize the reasoning path in inference. It includes the Parallelism Streamline in the generation phase to build up a flexible and adaptive parallelism with arbitrary paths by fine-grained cache management and alignment. Meanwhile, the Search and Transition Mechanism filters potential candidates to dynamically maintain the reasoning focus on more possible solutions and have less redundancy. Experiments on Qwen-2.5 and Llama-3 with Math500 and GSM8K datasets show that DPTS significantly improves efficiency by 2-4x on average while maintaining or even surpassing existing reasoning algorithms in accuracy, making ToT-based reasoning more scalable and computationally efficient.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes