CLAIJun 11, 2024

Paying More Attention to Source Context: Mitigating Unfaithful Translations from Large Language Model

arXiv:2406.07036v129 citations
Originality Incremental advance
AI Analysis

This work addresses unfaithful translations for users of LLMs in machine translation, representing an incremental improvement by mitigating a specific bias in existing models.

The paper tackled the problem of unfaithful translations in large language models (LLMs) by addressing their bias towards previously generated tokens over source context, resulting in methods that reduced hallucinatory translations across multiple language pairs as verified by human evaluation.

Large language models (LLMs) have showcased impressive multilingual machine translation ability. However, unlike encoder-decoder style models, decoder-only LLMs lack an explicit alignment between source and target contexts. Analyzing contribution scores during generation processes revealed that LLMs can be biased towards previously generated tokens over corresponding source tokens, leading to unfaithful translations. To address this issue, we propose to encourage LLMs to pay more attention to the source context from both source and target perspectives in zeroshot prompting: 1) adjust source context attention weights; 2) suppress irrelevant target prefix influence; Additionally, we propose 3) avoiding over-reliance on the target prefix in instruction tuning. Experimental results from both human-collected unfaithfulness test sets focusing on LLM-generated unfaithful translations and general test sets, verify our methods' effectiveness across multiple language pairs. Further human evaluation shows our method's efficacy in reducing hallucinatory translations and facilitating faithful translation generation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes