CLAIIRLGOct 30, 2022

Improving Bilingual Lexicon Induction with Cross-Encoder Reranking

CambridgeDeepMind
arXiv:2210.16953v2293 citationsh-index: 56
Originality Incremental advance
AI Analysis

This work addresses a crucial task in multilingual NLP for improving cross-lingual applications, but it is incremental as it builds on existing methods by adding a reranking step.

The paper tackles the problem of bilingual lexicon induction (BLI) with limited supervision by proposing BLICEr, a semi-supervised post-hoc reranking method that combines cross-lingual word embeddings with cross-encoder similarity scores from multilingual pretrained language models, achieving new state-of-the-art results on two standard benchmarks across diverse languages.

Bilingual lexicon induction (BLI) with limited bilingual supervision is a crucial yet challenging task in multilingual NLP. Current state-of-the-art BLI methods rely on the induction of cross-lingual word embeddings (CLWEs) to capture cross-lingual word similarities; such CLWEs are obtained 1) via traditional static models (e.g., VecMap), or 2) by extracting type-level CLWEs from multilingual pretrained language models (mPLMs), or 3) through combining the former two options. In this work, we propose a novel semi-supervised post-hoc reranking method termed BLICEr (BLI with Cross-Encoder Reranking), applicable to any precalculated CLWE space, which improves their BLI capability. The key idea is to 'extract' cross-lingual lexical knowledge from mPLMs, and then combine it with the original CLWEs. This crucial step is done via 1) creating a word similarity dataset, comprising positive word pairs (i.e., true translations) and hard negative pairs induced from the original CLWE space, and then 2) fine-tuning an mPLM (e.g., mBERT or XLM-R) in a cross-encoder manner to predict the similarity scores. At inference, we 3) combine the similarity score from the original CLWE space with the score from the BLI-tuned cross-encoder. BLICEr establishes new state-of-the-art results on two standard BLI benchmarks spanning a wide spectrum of diverse languages: it substantially outperforms a series of strong baselines across the board. We also validate the robustness of BLICEr with different CLWEs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes