CLMay 12, 2016

On the Convergent Properties of Word Embedding Methods

arXiv:1605.03956v11 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the need for robust evaluation metrics in natural language processing, particularly for researchers developing word embedding methods, though it appears incremental in scope.

The paper tackles the problem of evaluating the consistency and reliability of word embedding methods across different random initializations, proposing a new metric that measures this property and shows correlation with downstream task performance.

Do word embeddings converge to learn similar things over different initializations? How repeatable are experiments with word embeddings? Are all word embedding techniques equally reliable? In this paper we propose evaluating methods for learning word representations by their consistency across initializations. We propose a measure to quantify the similarity of the learned word representations under this setting (where they are subject to different random initializations). Our preliminary results illustrate that our metric not only measures a intrinsic property of word embedding methods but also correlates well with other evaluation metrics on downstream tasks. We believe our methods are is useful in characterizing robustness -- an important property to consider when developing new word embedding methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes