CLJun 2, 2021

Evaluating Word Embeddings with Categorical Modularity

Sílvia Casacuberta, Karina Halevy, Damián E. Blasi

arXiv:2106.00877v131.4711 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the need for efficient evaluation metrics in natural language processing, particularly for low-resource settings, though it is incremental as it builds on existing embedding models and tasks.

The authors tackled the problem of evaluating word embedding quality by introducing categorical modularity, a low-resource intrinsic metric, and found moderate to strong positive correlations with downstream tasks like sentiment analysis and bilingual lexicon induction across multiple languages and models.

We introduce categorical modularity, a novel low-resource intrinsic metric to evaluate word embedding quality. Categorical modularity is a graph modularity metric based on the $k$-nearest neighbor graph constructed with embedding vectors of words from a fixed set of semantic categories, in which the goal is to measure the proportion of words that have nearest neighbors within the same categories. We use a core set of 500 words belonging to 59 neurobiologically motivated semantic categories in 29 languages and analyze three word embedding models per language (FastText, MUSE, and subs2vec). We find moderate to strong positive correlations between categorical modularity and performance on the monolingual tasks of sentiment analysis and word similarity calculation and on the cross-lingual task of bilingual lexicon induction both to and from English. Overall, we suggest that categorical modularity provides non-trivial predictive information about downstream task performance, with breakdowns of correlations by model suggesting some meta-predictive properties about semantic information loss as well.

View on arXiv PDF Code

Similar