CLOct 30, 2023

Dynamics of Instruction Fine-Tuning for Chinese Large Language Models

arXiv:2310.19651v321 citationsh-index: 8
Originality Incremental advance
AI Analysis

This work addresses the scaling properties of instruction tuning for Chinese LLMs, which is incremental as it extends known English-focused research to another language.

The study systematically investigates how data quantity, model size, and data construction methods affect instruction tuning for Chinese large language models, finding that different abilities scale differently and can be optimized by tailoring training strategies, leading to enhanced performance on benchmarks.

Instruction tuning is a burgeoning method to elicit the general intelligence of Large Language Models (LLMs). While numerous studies have examined the impact of factors such as data volume and model size on English models, the scaling properties of instruction tuning in other languages remain largely unexplored. In this work, we systematically investigate the effects of data quantity, model size, and data construction methods on instruction tuning for Chinese LLMs. We utilize a newly curated dataset, DoIT, which includes over 40,000 high-quality instruction instances covering ten underlying abilities, such as creative writing, code generation, and logical reasoning. Our experiments, conducted on models ranging from 7b to 33b parameters, yield three key findings: (i) While these factors directly affect overall model performance, some abilities are more responsive to scaling, whereas others demonstrate significant resistance. (ii) The scaling sensitivity of different abilities to these factors can be explained by two features: Complexity and Transference. (iii) By tailoring training strategies to their varying sensitivities, specific abilities can be efficiently learned, enhancing performance on two public benchmarks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes