CLLGFeb 10, 2020

Multilingual Alignment of Contextual Word Representations

arXiv:2002.03518v2211 citations
AI Analysis

This work addresses the challenge of improving cross-lingual transfer in multilingual models for NLP applications, representing an incremental advancement with specific performance gains.

The authors tackled the problem of multilingual contextual embedding alignment in BERT by proposing evaluation procedures and an alignment method, resulting in significantly improved zero-shot performance on XNLI that matched pseudo-fully-supervised models for Bulgarian and Greek.

We propose procedures for evaluating and strengthening contextual embedding alignment and show that they are useful in analyzing and improving multilingual BERT. In particular, after our proposed alignment procedure, BERT exhibits significantly improved zero-shot performance on XNLI compared to the base model, remarkably matching pseudo-fully-supervised translate-train models for Bulgarian and Greek. Further, to measure the degree of alignment, we introduce a contextual version of word retrieval and show that it correlates well with downstream zero-shot transfer. Using this word retrieval task, we also analyze BERT and find that it exhibits systematic deficiencies, e.g. worse alignment for open-class parts-of-speech and word pairs written in different scripts, that are corrected by the alignment procedure. These results support contextual alignment as a useful concept for understanding large multilingual pre-trained models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes