CLSep 8, 2018

Exploration on Grounded Word Embedding: Matching Words and Images with Image-Enhanced Skip-Gram Model

arXiv:1809.02765v10.2

Originality Incremental advance

AI Analysis

This work addresses the interpretability issue in word embeddings for NLP researchers, though it is incremental as it builds on existing skip-gram models.

The paper tackled the problem of making word embeddings more interpretable by aligning them with image vectors, resulting in highly correlated embeddings that provide image-based explanations.

Word embedding is designed to represent the semantic meaning of a word with low dimensional vectors. The state-of-the-art methods of learning word embeddings (word2vec and GloVe) only use the word co-occurrence information. The learned embeddings are real number vectors, which are obscure to human. In this paper, we propose an Image-Enhanced Skip-Gram Model to learn grounded word embeddings by representing the word vectors in the same hyper-plane with image vectors. Experiments show that the image vectors and word embeddings learned by our model are highly correlated, which indicates that our model is able to provide a vivid image-based explanation to the word embeddings.

View on arXiv PDF

Similar