CLJun 28, 2025

The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure

Niyati Bafna, Tianjian Li, Kenton Murray, David R. Mortensen, David Yarowsky, Hale Sirin, Daniel Khashabi

arXiv:2506.22724v215.58 citationsh-index: 7IJCNLP-AACL

Originality Incremental advance

AI Analysis

This addresses a bottleneck in multilingual AI applications, particularly for low-resource languages, but is incremental as it formalizes and quantifies an existing hypothesis.

The paper tackled the problem of poor multilingual generation quality in large language models for mid- to low-resource languages by identifying an implicit task-solving to translation pipeline, and found that translation failure explains a dominant portion of errors, especially for low-resource languages, across 108 language pairs.

Multilingual generation with large language models (LLMs) is often of poor quality for mid- to low-resource languages, but the causes for this are not well-understood. We first demonstrate the existence of an implicit task-solving-->translation pipeline for generation, whereby the model first solves the required task in a largely target-language-agnostic manner, and subsequently translates answer concepts into the intended target language. We hypothesize that the failure of the translation stage, despite task-solving success, is an important culprit for the observed low quality of final outputs, and formalize this as the translation barrier hypothesis. We quantify the extent to which either stage in the pipeline is responsible for final failure for a word translation task across 108 language pairs, and find that the translation barrier explains a dominant portion of error for a majority of language pairs, and is especially severe for low-resource target languages. Our results highlight an important bottleneck for end-to-end multilingual generation, relevant for future work seeking to improve multilinguality in LLMs.

View on arXiv PDF

Similar