CLAISep 12, 2019

Retrofitting Contextualized Word Embeddings with Paraphrases

arXiv:1909.09700v11004 citations
Originality Incremental advance
AI Analysis

This addresses robustness issues in NLP models for tasks like classification and inference, but it is incremental as it builds on existing embedding methods.

The paper tackled the problem of contextualized word embeddings being unstable under paraphrasing, which reduces robustness in downstream tasks, and proposed a retrofitting method that significantly improved performance over the original ELMo on sentence classification and language inference tasks.

Contextualized word embedding models, such as ELMo, generate meaningful representations of words and their context. These models have been shown to have a great impact on downstream applications. However, in many cases, the contextualized embedding of a word changes drastically when the context is paraphrased. As a result, the downstream model is not robust to paraphrasing and other linguistic variations. To enhance the stability of contextualized word embedding models, we propose an approach to retrofitting contextualized embedding models with paraphrase contexts. Our method learns an orthogonal transformation on the input space, which seeks to minimize the variance of word representations on paraphrased contexts. Experiments show that the retrofitted model significantly outperforms the original ELMo on various sentence classification and language inference tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes