CLJan 21, 2018

A Survey of Word Embeddings Evaluation Methods

arXiv:1801.09536v1194 citations
Originality Synthesis-oriented
AI Analysis

It provides a systematic overview for researchers and practitioners working with word embeddings, but it is incremental as it compiles existing methods without introducing new ones.

This paper surveys evaluation methods for word embeddings, summarizing 16 intrinsic and 12 extrinsic approaches to address the open problem of determining the most adequate evaluation techniques in the field.

Word embeddings are real-valued word representations able to capture lexical semantics and trained on natural language corpora. Models proposing these representations have gained popularity in the recent years, but the issue of the most adequate evaluation method still remains open. This paper presents an extensive overview of the field of word embeddings evaluation, highlighting main problems and proposing a typology of approaches to evaluation, summarizing 16 intrinsic methods and 12 extrinsic methods. I describe both widely-used and experimental methods, systematize information about evaluation datasets and discuss some key challenges.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes