CLAILGMay 14, 2025

System Prompt Optimization with Meta-Learning

arXiv:2505.09666v214 citationsh-index: 20
Originality Highly original
AI Analysis

This addresses a gap in LLM prompt optimization by focusing on system prompts, which are task-agnostic and reusable, potentially benefiting developers and researchers seeking efficient and generalizable LLM tuning.

The paper tackles the problem of optimizing system prompts for Large Language Models (LLMs), which have been overlooked in favor of task-specific user prompts, by proposing a bilevel meta-learning framework that learns system prompts robust to diverse user prompts and transferable to unseen tasks. The approach shows effective generalization across 14 unseen datasets spanning 5 domains, enabling rapid adaptation with fewer optimization steps and improved performance.

Large Language Models (LLMs) have shown remarkable capabilities, with optimizing their input prompts playing a pivotal role in maximizing their performance. However, while LLM prompts consist of both the task-agnostic system prompts and task-specific user prompts, existing work on prompt optimization has focused on user prompts specific to individual queries or tasks, and largely overlooked the system prompt that is, once optimized, applicable across different tasks and domains. Motivated by this, we introduce the novel problem of bilevel system prompt optimization, whose objective is to design system prompts that are robust to diverse user prompts and transferable to unseen tasks. To tackle this problem, we then propose a meta-learning framework, which meta-learns the system prompt by optimizing it over various user prompts across multiple datasets, while simultaneously updating the user prompts in an iterative manner to ensure synergy between them. We conduct experiments on 14 unseen datasets spanning 5 different domains, on which we show that our approach produces system prompts that generalize effectively to diverse user prompts. Also, our findings reveal that the optimized system prompt enables rapid adaptation even to unseen tasks, requiring fewer optimization steps for test-time user prompts while achieving improved performance.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes