CL LGFeb 15, 2023

Dictionary-based Phrase-level Prompting of Large Language Models for Machine Translation

Marjan Ghazvininejad, Hila Gonen, Luke Zettlemoyer

arXiv:2302.07856v116.2110 citationsh-index: 116

Originality Incremental advance

AI Analysis

This addresses translation challenges for low-resource or domain-specific applications, though it is an incremental improvement over existing prompting methods.

The paper tackles the problem of rare words in machine translation with large language models by using bilingual dictionaries to provide phrase-level control hints in prompts, resulting in DiPMT outperforming baselines in low-resource and out-of-domain scenarios.

Large language models (LLMs) demonstrate remarkable machine translation (MT) abilities via prompting, even though they were not explicitly trained for this task. However, even given the incredible quantities of data they are trained on, LLMs can struggle to translate inputs with rare words, which are common in low resource or domain transfer scenarios. We show that LLM prompting can provide an effective solution for rare words as well, by using prior knowledge from bilingual dictionaries to provide control hints in the prompts. We propose a novel method, DiPMT, that provides a set of possible translations for a subset of the input words, thereby enabling fine-grained phrase-level prompted control of the LLM. Extensive experiments show that DiPMT outperforms the baseline both in low-resource MT, as well as for out-of-domain MT. We further provide a qualitative analysis of the benefits and limitations of this approach, including the overall level of controllability that is achieved.

View on arXiv PDF

Similar