CLLGDec 20, 2014

Outperforming Word2Vec on Analogy Tasks with Random Projections

arXiv:1412.6616v24 citations
Originality Incremental advance
AI Analysis

This provides a simpler alternative to neural network methods for natural language processing tasks, though it appears incremental as it builds on existing BEAGLE concepts.

The authors tackled the problem of creating distributed vector representations for words without complex training, achieving state-of-the-art results on analogy tasks, often outperforming Word2Vec.

We present a distributed vector representation based on a simplification of the BEAGLE system, designed in the context of the Sigma cognitive architecture. Our method does not require gradient-based training of neural networks, matrix decompositions as with LSA, or convolutions as with BEAGLE. All that is involved is a sum of random vectors and their pointwise products. Despite the simplicity of this technique, it gives state-of-the-art results on analogy problems, in most cases better than Word2Vec. To explain this success, we interpret it as a dimension reduction via random projection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes