CLAILGJul 2, 2024

RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs

arXiv:2407.02552v147 citationsh-index: 18
Originality Highly original
AI Analysis

This work expands alignment techniques to 23 languages covering half of the world's population, addressing a gap in multilingual LLM optimization.

The paper tackled the problem of aligning multilingual large language models (LLMs) with preference optimization, which had previously focused on a few languages like English and Chinese, by introducing a scalable method for generating high-quality multilingual feedback data and demonstrating cross-lingual transfer benefits, resulting in a model that achieved a 54.4% win-rate against the state-of-the-art multilingual LLM Aya 23 8B and 69.5% or higher win-rates against other widely used models.

Preference optimization techniques have become a standard final stage for training state-of-art large language models (LLMs). However, despite widespread adoption, the vast majority of work to-date has focused on first-class citizen languages like English and Chinese. This captures a small fraction of the languages in the world, but also makes it unclear which aspects of current state-of-the-art research transfer to a multilingual setting. In this work, we perform an exhaustive study to achieve a new state-of-the-art in aligning multilingual LLMs. We introduce a novel, scalable method for generating high-quality multilingual feedback data to balance data coverage. We establish the benefits of cross-lingual transfer and increased dataset size in preference training. Our preference-trained model achieves a 54.4% win-rate against Aya 23 8B, the current state-of-the-art multilingual LLM in its parameter class, and a 69.5% win-rate or higher against widely used models like Gemma-1.1-7B-it, Llama-3-8B-Instruct, Mistral-7B-Instruct-v0.3. As a result of our study, we expand the frontier of alignment techniques to 23 languages covering half of the world's population.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes