MLAICLLGFeb 15, 2024

Efficient Prompt Optimization Through the Lens of Best Arm Identification

arXiv:2402.09723v332 citationsh-index: 13NIPS
Originality Highly original
AI Analysis

This work addresses the cost-effective optimization of prompts for LLM users, offering a novel selection method that is incremental but impactful for practical applications.

The paper tackles the problem of efficiently selecting optimal prompts for large language models under a limited budget by introducing TRIPLE, a framework that connects prompt optimization to best arm identification in multi-armed bandits, resulting in significant performance improvements over baselines in experiments across multiple tasks and models.

The remarkable instruction-following capability of large language models (LLMs) has sparked a growing interest in automatically finding good prompts, i.e., prompt optimization. Most existing works follow the scheme of selecting from a pre-generated pool of candidate prompts. However, these designs mainly focus on the generation strategy, while limited attention has been paid to the selection method. Especially, the cost incurred during the selection (e.g., accessing LLM and evaluating the responses) is rarely explicitly considered. To overcome this limitation, this work provides a principled framework, TRIPLE, to efficiently perform prompt selection under an explicit budget constraint. TRIPLE is built on a novel connection established between prompt optimization and fixed-budget best arm identification (BAI-FB) in multi-armed bandits (MAB); thus, it is capable of leveraging the rich toolbox from BAI-FB systematically and also incorporating unique characteristics of prompt optimization. Extensive experiments on multiple well-adopted tasks using various LLMs demonstrate the remarkable performance improvement of TRIPLE over baselines while satisfying the limited budget constraints. As an extension, variants of TRIPLE are proposed to efficiently select examples for few-shot prompts, also achieving superior empirical performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes