CLMar 4, 2018

Concatenated Power Mean Word Embeddings as Universal Cross-Lingual Sentence Representations

arXiv:1803.01400v2102 citations
Originality Incremental advance
AI Analysis

This provides a more effective and harder-to-beat baseline for cross-lingual and monolingual sentence representation tasks, benefiting NLP researchers and practitioners.

The authors tackled the problem of improving sentence embeddings by generalizing average word embeddings to power mean word embeddings, showing that concatenating different types substantially outperforms state-of-the-art methods cross-lingually and closes the gap monolingually with solid margins over baselines like SIF and Sent2Vec.

Average word embeddings are a common baseline for more sophisticated sentence embedding techniques. However, they typically fall short of the performances of more complex models such as InferSent. Here, we generalize the concept of average word embeddings to power mean word embeddings. We show that the concatenation of different types of power mean word embeddings considerably closes the gap to state-of-the-art methods monolingually and substantially outperforms these more complex techniques cross-lingually. In addition, our proposed method outperforms different recently proposed baselines such as SIF and Sent2Vec by a solid margin, thus constituting a much harder-to-beat monolingual baseline. Our data and code are publicly available.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes