CLAIFeb 17, 2025

System Message Generation for User Preferences using Open-Source Models

arXiv:2502.11330v21 citationsh-index: 6Has Code
Originality Incremental advance
AI Analysis

This addresses the resource-intensive challenge of manually annotating system messages for LLMs, offering a practical solution for industrial applications, though it is incremental as it builds on existing fine-tuning methods.

The paper tackles the problem of generating system messages for large language models to better align assistant responses with user instructions, using existing datasets that lack such messages, and reports substantial improvements in conversation benchmarks, with strong gains in shorter conversations.

System messages play a crucial role in interactions with large language models (LLMs), often serving as prompts to initiate conversations. Through system messages, users can assign specific roles, perform intended tasks, incorporate background information, and specify various output formats and communication styles. Despite such versatility, publicly available datasets often lack system messages and are subject to strict license constraints in industrial applications. Moreover, manually annotating system messages that align with user instructions is resource-intensive. In light of these challenges, we introduce SysGen, a pipeline for generating system messages that better align assistant responses with user instructions using existing supervised fine-tuning datasets that lack system messages. Training open-source models on SysGen data yields substantial improvements in both single-turn (Multifacet) and multi-turn (SysBench) conversation benchmarks. Notably, our method shows strong gains in shorter conversations, suggesting that it enhances early-stage interaction effectiveness. Our qualitative analysis further emphasizes the value of diverse and structured system messages in improving LLM adaptability across varied user scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes