CLLGMLOct 24, 2016

Geometry of Polysemy

arXiv:1610.07569v121 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of representing multiple meanings of words in NLP, which is crucial for improving language understanding tasks, though it appears incremental as it builds on existing vector-based methods.

The paper tackles the problem of modeling polysemous words in NLP by proposing an unsupervised approach that uses low-rank subspaces for context representations and a clustering algorithm on Grassmannian geometry for sense disambiguation, achieving new state-of-the-art results on standard datasets.

Vector representations of words have heralded a transformational approach to classical problems in NLP; the most popular example is word2vec. However, a single vector does not suffice to model the polysemous nature of many (frequent) words, i.e., words with multiple meanings. In this paper, we propose a three-fold approach for unsupervised polysemy modeling: (a) context representations, (b) sense induction and disambiguation and (c) lexeme (as a word and sense pair) representations. A key feature of our work is the finding that a sentence containing a target word is well represented by a low rank subspace, instead of a point in a vector space. We then show that the subspaces associated with a particular sense of the target word tend to intersect over a line (one-dimensional subspace), which we use to disambiguate senses using a clustering algorithm that harnesses the Grassmannian geometry of the representations. The disambiguation algorithm, which we call $K$-Grassmeans, leads to a procedure to label the different senses of the target word in the corpus -- yielding lexeme vector representations, all in an unsupervised manner starting from a large (Wikipedia) corpus in English. Apart from several prototypical target (word,sense) examples and a host of empirical studies to intuit and justify the various geometric representations, we validate our algorithms on standard sense induction and disambiguation datasets and present new state-of-the-art results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes