AIJan 17, 2025

A Survey on LLM Test-Time Compute via Search: Tasks, LLM Profiling, Search Algorithms, and Relevant Frameworks

arXiv:2501.10069v419 citationsh-index: 5Has CodeTrans. Mach. Learn. Res.
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of fragmented comparisons in LLM inference research for researchers, but it is incremental as it synthesizes existing methods without introducing new techniques.

This survey tackles the challenge of comparing LLM test-time compute frameworks by unifying task definitions under Markov Decision Processes and providing modular definitions for LLM profiling and search procedures, enabling precise comparisons and highlighting departures from conventional algorithms.

LLM test-time compute (or LLM inference) via search has emerged as a promising research area with rapid developments. However, current frameworks often adopt distinct perspectives on three key aspects: task definition, LLM profiling, and search procedures, making direct comparisons challenging. Moreover, the search algorithms employed often diverge from standard implementations, and their specific characteristics are not thoroughly specified. This survey aims to provide a comprehensive but integrated technical review on existing LIS frameworks. Specifically, we unify task definitions under Markov Decision Process (MDP) and provides modular definitions of LLM profiling and search procedures. The definitions enable precise comparisons of various LLM inference frameworks while highlighting their departures from conventional search algorithms. We also discuss the applicability, performance, and efficiency of these methods. For ongoing paper updates, please refer to our GitHub repository: https://github.com/xinzhel/LLM-Search.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes