CLAIApr 14, 2019

Distributed representation of multi-sense words: A loss-driven approach

arXiv:1904.06725v13 citations
Originality Incremental advance
AI Analysis

This work addresses a specific limitation in natural language processing for representing polysemous words, offering an incremental improvement over current state-of-the-art approaches like Word2Vec.

The paper tackles the problem of representing multi-sense words in distributed representations by proposing LDMI, a loss-driven model that assigns different vectors to each sense, leading to improved performance on contextual word similarity tasks compared to existing methods.

Word2Vec's Skip Gram model is the current state-of-the-art approach for estimating the distributed representation of words. However, it assumes a single vector per word, which is not well-suited for representing words that have multiple senses. This work presents LDMI, a new model for estimating distributional representations of words. LDMI relies on the idea that, if a word carries multiple senses, then having a different representation for each of its senses should lead to a lower loss associated with predicting its co-occurring words, as opposed to the case when a single vector representation is used for all the senses. After identifying the multi-sense words, LDMI clusters the occurrences of these words to assign a sense to each occurrence. Experiments on the contextual word similarity task show that LDMI leads to better performance than competing approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes