CLDec 17, 2024

Refining Answer Distributions for Improved Large Language Model Reasoning

arXiv:2412.13292v23 citationsh-index: 9
Originality Incremental advance
AI Analysis

This work addresses a bottleneck in enhancing LLM reasoning for AI applications, though it appears incremental as it builds on existing combination strategies.

The paper tackles the problem of inefficient combination of multiple LLM responses for reasoning tasks by introducing Refined Answer Distributions, an iterative sampling framework that improves reasoning performance on benchmarks.

Large Language Models (LLMs) have exhibited an impressive capability to perform reasoning tasks, especially if they are encouraged to generate a sequence of intermediate steps. Reasoning performance can be improved by suitably combining multiple LLM responses, generated either in parallel in a single query, or via sequential interactions with LLMs throughout the reasoning process. Existing strategies for combination, such as self-consistency and progressive-hint-prompting, make inefficient usage of the LLM responses. We present Refined Answer Distributions, a novel and principled algorithmic framework to enhance the reasoning capabilities of LLMs. Our approach can be viewed as an iterative sampling strategy for forming a Monte Carlo approximation of an underlying distribution of answers, with the goal of identifying the mode -- the most likely answer. Empirical evaluation on several reasoning benchmarks demonstrates the superiority of the proposed approach.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes