CL AI LGJul 2, 2024

RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs

John Dang, Arash Ahmadian, Kelly Marchisio, Julia Kreutzer, Ahmet Üstün, Sara Hooker

arXiv:2407.02552v120.949 citationsh-index: 18

Originality Highly original

AI Analysis

This work expands alignment techniques to 23 languages covering half of the world's population, addressing a gap in multilingual LLM optimization.

The paper tackled the problem of aligning multilingual large language models (LLMs) with preference optimization, which had previously focused on a few languages like English and Chinese, by introducing a scalable method for generating high-quality multilingual feedback data and demonstrating cross-lingual transfer benefits, resulting in a model that achieved a 54.4% win-rate against the state-of-the-art multilingual LLM Aya 23 8B and 69.5% or higher win-rates against other widely used models.

Preference optimization techniques have become a standard final stage for training state-of-art large language models (LLMs). However, despite widespread adoption, the vast majority of work to-date has focused on first-class citizen languages like English and Chinese. This captures a small fraction of the languages in the world, but also makes it unclear which aspects of current state-of-the-art research transfer to a multilingual setting. In this work, we perform an exhaustive study to achieve a new state-of-the-art in aligning multilingual LLMs. We introduce a novel, scalable method for generating high-quality multilingual feedback data to balance data coverage. We establish the benefits of cross-lingual transfer and increased dataset size in preference training. Our preference-trained model achieves a 54.4% win-rate against Aya 23 8B, the current state-of-the-art multilingual LLM in its parameter class, and a 69.5% win-rate or higher against widely used models like Gemma-1.1-7B-it, Llama-3-8B-Instruct, Mistral-7B-Instruct-v0.3. As a result of our study, we expand the frontier of alignment techniques to 23 languages covering half of the world's population.

View on arXiv PDF

Similar