CLFeb 17, 2025

InsBank: Evolving Instruction Subset for Ongoing Alignment

arXiv:2502.11419v21 citationsh-index: 11EMNLP
Originality Incremental advance
AI Analysis

This work addresses the need for cost-effective and adaptive instruction tuning in LLMs, though it appears incremental as it builds on existing subset selection methods.

The paper tackles the problem of evolving instruction subsets for ongoing alignment of large language models by introducing InsBank and the PIBE framework, which significantly outperforms baselines in maintaining efficiency and diversity.

Large language models (LLMs) typically undergo instruction tuning to enhance alignment. Recent studies emphasize that quality and diversity of instruction data are more crucial than quantity, highlighting the need to select diverse, high-quality subsets to reduce training costs. However, how to evolve these selected subsets alongside the development of new instruction data remains insufficiently explored. To achieve LLMs' ongoing alignment, we introduce Instruction Bank (\textbf{InsBank}), a continuously updated repository that integrates the latest valuable instruction data. We further propose Progressive Instruction Bank Evolution (\textbf{PIBE}), a novel framework designed to evolve InsBank effectively and efficiently over time. PIBE employs a gradual data selection strategy to maintain long-term efficiency, leveraging a representation-based diversity score to capture relationships between data points and retain historical information for comprehensive diversity evaluation. This also allows for flexible combination of diversity and quality scores during data selection and ranking. Extensive experiments demonstrate that PIBE significantly outperforms baselines in InsBank evolution and is able to extract budget-specific subsets, demonstrating its effectiveness and adaptability.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes