CLNov 17, 2023

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

AI2UW
arXiv:2311.10702v2256 citationsh-index: 28
Originality Synthesis-oriented
AI Analysis

This work provides enhanced open resources for instruction tuning, facilitating future efforts in adapting large language models, though it is incremental in building upon existing TÜLU methods.

The authors tackled the problem of adapting pretrained language models to downstream tasks by developing TÜLU 2, a suite of improved models that achieve state-of-the-art performance among open models and match or exceed GPT-3.5-turbo-0301 on several benchmarks.

Since the release of TÜLU [Wang et al., 2023b], open resources for instruction tuning have developed quickly, from better base models to new finetuning techniques. We test and incorporate a number of these advances into TÜLU, resulting in TÜLU 2, a suite of improved TÜLU models for advancing the understanding and best practices of adapting pretrained language models to downstream tasks and user preferences. Concretely, we release: (1) TÜLU-V2-mix, an improved collection of high-quality instruction datasets; (2) TÜLU 2, LLAMA-2 models finetuned on the V2 mixture; (3) TÜLU 2+DPO, TÜLU 2 models trained with direct preference optimization (DPO), including the largest DPO-trained model to date (TÜLU 2+DPO 70B); (4) CODE TÜLU 2, CODE LLAMA models finetuned on our V2 mix that outperform CODE LLAMA and its instruction-tuned variant, CODE LLAMA-Instruct. Our evaluation from multiple perspectives shows that the TÜLU 2 suite achieves state-of-the-art performance among open models and matches or exceeds the performance of GPT-3.5-turbo-0301 on several benchmarks. We release all the checkpoints, data, training and evaluation code to facilitate future open efforts on adapting large language models.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes