CLDec 3, 2023

Bridging Background Knowledge Gaps in Translation with Automatic Explicitation

arXiv:2312.01308v1132 citationsh-index: 36EMNLP
Originality Incremental advance
AI Analysis

This work addresses a specific issue in NLP for translation by providing a method to enhance understanding for users with cultural or knowledge gaps, though it is incremental as it builds on existing translation and dataset collection approaches.

The paper tackled the problem of translations being incomprehensible due to background knowledge gaps by introducing automatic techniques for generating explicitations, using a new dataset called WikiExpl, and showed that these explicitations improve question answering accuracy in a multilingual framework.

Translations help people understand content written in another language. However, even correct literal translations do not fulfill that goal when people lack the necessary background to understand them. Professional translators incorporate explicitations to explain the missing context by considering cultural differences between source and target audiences. Despite its potential to help users, NLP research on explicitation is limited because of the dearth of adequate evaluation methods. This work introduces techniques for automatically generating explicitations, motivated by WikiExpl: a dataset that we collect from Wikipedia and annotate with human translators. The resulting explicitations are useful as they help answer questions more accurately in a multilingual question answering framework.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes