AILGOct 4, 2025

GuidedSampling: Steering LLMs Towards Diverse Candidate Solutions at Inference-Time

arXiv:2510.03777v11 citationsh-index: 16
Originality Incremental advance
AI Analysis

This addresses the issue of redundant candidate solutions in LLM inference for complex tasks, offering a method to enhance diversity and performance, though it is incremental as it builds on existing RS approaches.

The paper tackled the problem of limited diversity in Repeated Sampling (RS) inference algorithms for LLMs by proposing GuidedSampling, which decouples exploration and generation phases to increase solution diversity, resulting in an average ~21.6% improvement in pass@50 performance and ~9.7% improvement in pass@5 for trained models.

Repeated Sampling (RS) is a simple inference-time algorithm that has been shown to improve model performance on complex tasks. Although it is an effective way of scaling inference time, it often struggles to generate diverse solution candidates, frequently relying on the same underlying approach to solve the problem and thus producing redundant samples. To address this limitation, we propose a new inference algorithm, GuidedSampling, which decouples the exploration and generation phases during inference, increasing diversity of generated candidate solutions. The exploration phase identifies multiple concepts that can be utilized to solve the problem, while the generation phase applies a specific concept to provide final solution candidates. We first define the theoretical bounds of GuidedSampling and then empirically demonstrate that it improves the performance of base model at pass@50 by on an average ~21.6% across various benchmarks compared to RS. Furthermore, models trained on trajectories of GuidedSampling exhibit substantial performance improvements at pass@5 by on an average ~9.7%, compared to models trained on traditional RS. Additionally, models trained with GuidedSampling increases the average number of concepts per instance (1.67 -> 3.03), yielding a diverse set of candidates than traditional RS.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes