CLOct 9, 2022

Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment

Tsinghua
arXiv:2210.04141v1292 citationsh-index: 49
Originality Incremental advance
AI Analysis

This work improves word alignment, a fundamental tool for natural language processing, by enhancing cross-lingual modeling, but it is incremental as it builds on pre-trained multilingual models.

The paper tackled the problem of word alignment by addressing the lack of deep cross-lingual interactions in existing methods, which degrade alignment quality for ambiguous words, and proposed Cross-Align to model these interactions, achieving state-of-the-art performance on four out of five language pairs.

Word alignment which aims to extract lexicon translation equivalents between source and target sentences, serves as a fundamental tool for natural language processing. Recent studies in this area have yielded substantial improvements by generating alignments from contextualized embeddings of the pre-trained multilingual language models. However, we find that the existing approaches capture few interactions between the input sentence pairs, which degrades the word alignment quality severely, especially for the ambiguous words in the monolingual context. To remedy this problem, we propose Cross-Align to model deep interactions between the input sentence pairs, in which the source and target sentences are encoded separately with the shared self-attention modules in the shallow layers, while cross-lingual interactions are explicitly constructed by the cross-attention modules in the upper layers. Besides, to train our model effectively, we propose a two-stage training framework, where the model is trained with a simple Translation Language Modeling (TLM) objective in the first stage and then finetuned with a self-supervised alignment objective in the second stage. Experiments show that the proposed Cross-Align achieves the state-of-the-art (SOTA) performance on four out of five language pairs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes