AIJun 18, 2024

Refine Large Language Model Fine-tuning via Instruction Vector

arXiv:2406.12227v36 citations
Originality Incremental advance
AI Analysis

This work addresses catastrophic forgetting in LLMs, which is a domain-specific problem for AI researchers and practitioners, though it is incremental as it builds on existing fine-tuning methods.

The paper tackles the problem of catastrophic forgetting in fine-tuned large language models by identifying instruction following as a key contributor and proposing the Instruction Vector framework to analyze and mitigate it, with empirical tests on three benchmarks confirming efficacy.

Fine-tuning large language models (LLMs) can cause them to lose their general capabilities. However, the intrinsic mechanisms behind such forgetting remain unexplored. In this paper, we begin by examining this phenomenon by focusing on knowledge understanding and instruction following, with the latter identified as the main contributor to forgetting during fine-tuning. Consequently, we propose the Instruction Vector (IV) framework to capture model representations highly related to specific instruction-following capabilities, thereby making it possible to understand model-intrinsic forgetting. Through the analysis of IV dynamics pre and post-training, we suggest that fine-tuning mostly adds specialized reasoning patterns instead of erasing previous skills, which may appear as forgetting. Building on this insight, we develop IV-guided training, which aims to preserve original computation graph, thereby mitigating catastrophic forgetting. Empirical tests on three benchmarks confirm the efficacy of this new approach, supporting the relationship between IVs and forgetting. Our code will be made available soon.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes