Word Embeddings from Large-Scale Greek Web Content
This provides Greek NLP practitioners with essential linguistic resources, but it is incremental as it applies existing methods to new data.
The authors tackled the lack of large-scale Greek word embeddings by training them on the largest digital Greek corpus to date, resulting in a live web tool for testing these embeddings with functions like analogy and similarity scoring.
Word embeddings are undoubtedly very useful components in many NLP tasks. In this paper, we present word embeddings and other linguistic resources trained on the largest to date digital Greek language corpus. We also present a live web tool for testing the Greek word embeddings, by offering "analogy", "similarity score" and "most similar words" functions. Through our explorer, one could interact with the Greek word vectors.