CLOct 8, 2018

Word Embeddings from Large-Scale Greek Web Content

arXiv:1810.06694v313 citations
Originality Synthesis-oriented
AI Analysis

This provides Greek NLP practitioners with essential linguistic resources, but it is incremental as it applies existing methods to new data.

The authors tackled the lack of large-scale Greek word embeddings by training them on the largest digital Greek corpus to date, resulting in a live web tool for testing these embeddings with functions like analogy and similarity scoring.

Word embeddings are undoubtedly very useful components in many NLP tasks. In this paper, we present word embeddings and other linguistic resources trained on the largest to date digital Greek language corpus. We also present a live web tool for testing the Greek word embeddings, by offering "analogy", "similarity score" and "most similar words" functions. Through our explorer, one could interact with the Greek word vectors.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes