CLAISep 16, 2023

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

arXiv:2309.08958v2138 citationsh-index: 45
Originality Incremental advance
AI Analysis

This provides a cost-efficient guide for expanding language support in AI applications like chat assistants, though it is incremental in nature.

The study compared monolingual and multilingual instruction tuning for large language models, finding that multilingual tuning performs as well or better than per-language tuning under a fixed computation budget, with downsampled data also proving effective and robust.

Foundational large language models (LLMs) can be instruction-tuned to perform open-domain question answering, facilitating applications like chat assistants. While such efforts are often carried out in a single language, we empirically analyze cost-efficient strategies for multilingual scenarios. Our study employs the Alpaca dataset and machine translations of it to form multilingual data, which is then used to tune LLMs through either low-rank adaptation or full-parameter training. Under a controlled computation budget, comparisons show that multilingual tuning is on par or better than tuning a model for each language. Furthermore, multilingual tuning with downsampled data can be as powerful and more robust. Our findings serve as a guide for expanding language support through instruction tuning.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes