CLDec 5, 2024

Evolutionary Pre-Prompt Optimization for Mathematical Reasoning

arXiv:2412.04291v13 citationsh-index: 5
Originality Incremental advance
AI Analysis

This work addresses the challenge of enhancing mathematical reasoning in LLMs through better prompt design, representing an incremental improvement in few-shot learning methods.

The paper tackles the problem of optimizing example selection for chain-of-thought pre-prompts in large language models, achieving over 10 absolute points improvement in exact match scores on GSM8k and MathQA benchmarks.

Recent advancements have highlighted that large language models (LLMs), when given a small set of task-specific examples, demonstrate remarkable proficiency, a capability that extends to complex reasoning tasks. In particular, the combination of few-shot learning with the chain-of-thought (CoT) approach has been pivotal in steering models towards more logically consistent conclusions. This paper explores the optimization of example selection for designing effective CoT pre-prompts and shows that the choice of the optimization algorithm, typically in favor of comparison-based methods such as evolutionary computation, significantly enhances efficacy and feasibility. Specifically, thanks to a limited exploitative and overfitted optimization, Evolutionary Pre-Prompt Optimization (EPPO) brings an improvement over the naive few-shot approach exceeding 10 absolute points in exact match scores on benchmark datasets such as GSM8k and MathQA. These gains are consistent across various contexts and are further amplified when integrated with self-consistency (SC)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes