CLNov 5, 2020

Learning Efficient Task-Specific Meta-Embeddings with Word Prisms

Jingyi He, KC Tsiolis, Kian Kenyon-Dean, Jackie Chi Kit Cheung

arXiv:2011.02944v131.0990 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the need for efficient and effective meta-embeddings in NLP, though it is incremental as it builds on existing meta-embedding methods.

The paper tackles the problem of combining multiple word embeddings for NLP tasks by introducing word prisms, a method that learns task-specific linear combinations, resulting in improved performance across all six evaluated tasks.

Word embeddings are trained to predict word cooccurrence statistics, which leads them to possess different lexical properties (syntactic, semantic, etc.) depending on the notion of context defined at training time. These properties manifest when querying the embedding space for the most similar vectors, and when used at the input layer of deep neural networks trained to solve downstream NLP problems. Meta-embeddings combine multiple sets of differently trained word embeddings, and have been shown to successfully improve intrinsic and extrinsic performance over equivalent models which use just one set of source embeddings. We introduce word prisms: a simple and efficient meta-embedding method that learns to combine source embeddings according to the task at hand. Word prisms learn orthogonal transformations to linearly combine the input source embeddings, which allows them to be very efficient at inference time. We evaluate word prisms in comparison to other meta-embedding methods on six extrinsic evaluations and observe that word prisms offer improvements in performance on all tasks.

View on arXiv PDF Code

Similar