CLApr 20, 2025

Trans-Zero: Self-Play Incentivizes Large Language Models for Multilingual Translation Without Parallel Data

Wei Zou, Sen Yang, Yu Bao, Shujian Huang, Jiajun Chen, Shanbo Cheng

arXiv:2504.14669v25 citationsh-index: 13ACL

Originality Incremental advance

AI Analysis

This addresses data scarcity and catastrophic forgetting in multilingual translation for low-resource languages, though it is incremental as it builds on existing self-play and optimization techniques.

The paper tackles the problem of multilingual machine translation without parallel data by proposing TRANS-ZERO, a self-play framework using monolingual data and LLM knowledge, which achieves translation performance rivaling supervised methods and excels in non-English directions.

The rise of Large Language Models (LLMs) has reshaped machine translation (MT), but multilingual MT still relies heavily on parallel data for supervised fine-tuning (SFT), facing challenges like data scarcity for low-resource languages and catastrophic forgetting. To address these issues, we propose TRANS-ZERO, a self-play framework that leverages only monolingual data and the intrinsic multilingual knowledge of LLM. TRANS-ZERO combines Genetic Monte-Carlo Tree Search (G-MCTS) with preference optimization, achieving strong translation performance that rivals supervised methods. Experiments demonstrate that this approach not only matches the performance of models trained on large-scale parallel data but also excels in non-English translation directions. Further analysis reveals that G-MCTS itself significantly enhances translation quality by exploring semantically consistent candidates through iterative translations, providing a robust foundation for the framework's succuss.

View on arXiv PDF

Similar