SEAILGOct 22, 2024

Scattered Forest Search: Smarter Code Space Exploration with LLMs

arXiv:2411.05010v29 citationsh-index: 25
Originality Highly original
AI Analysis

This addresses code generation efficiency for developers and AI systems, representing a strong incremental advance over existing search techniques.

The paper tackles code generation by framing it as a black-box optimization problem and proposes Scattered Forest Search (SFS) to improve solution diversity and feedback exploitation during evolutionary search. The method achieves a pass@1 rate of 67.1% on HumanEval+ and 87.2% on HumanEval with GPT-3.5, marking improvements of 8.6% and 4.3% over state-of-the-art while halving iterations needed.

We frame code generation as a black-box optimization problem within the code space and demonstrate how optimization-inspired techniques can enhance inference scaling. Based on this perspective, we propose SCATTERED FOREST SEARCH (SFS), a novel approach that improves solution diversity and better exploits feedback during evolutionary search. Our theoretical analysis illustrates how these methods help avoid local optima during optimization, leading to more efficient exploration. Extensive experiments on HumanEval, MBPP, APPS, CodeContests, and Leetcode reveal significant performance gains. For instance, our method achieves a pass@1 rate of 67.1% on HumanEval+ and 87.2% on HumanEval with GPT-3.5, marking improvements of 8.6% and 4.3% over the state-of-the-art, while also halving the iterations needed to find the correct solution. Furthermore, our approach scales more efficiently than existing search techniques, including tree search, line search, and repeated sampling.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes