CLAIJun 3, 2018

Learning Semantic Sentence Embeddings using Sequential Pair-wise Discriminator

arXiv:1806.00807v51107 citations
Originality Incremental advance
AI Analysis

This addresses the need for effective sentence-level embeddings in NLP, offering a novel approach that improves over existing methods, though it is incremental in building on encoder-decoder models.

The paper tackles the problem of generating semantic sentence embeddings by proposing a sequential pair-wise discriminator method that ensures paraphrase embeddings are close and unrelated ones are far, resulting in state-of-the-art performance on paraphrase generation and sentiment analysis tasks with statistically significant results.

In this paper, we propose a method for obtaining sentence-level embeddings. While the problem of securing word-level embeddings is very well studied, we propose a novel method for obtaining sentence-level embeddings. This is obtained by a simple method in the context of solving the paraphrase generation task. If we use a sequential encoder-decoder model for generating paraphrase, we would like the generated paraphrase to be semantically close to the original sentence. One way to ensure this is by adding constraints for true paraphrase embeddings to be close and unrelated paraphrase candidate sentence embeddings to be far. This is ensured by using a sequential pair-wise discriminator that shares weights with the encoder that is trained with a suitable loss function. Our loss function penalizes paraphrase sentence embedding distances from being too large. This loss is used in combination with a sequential encoder-decoder network. We also validated our method by evaluating the obtained embeddings for a sentiment analysis task. The proposed method results in semantic embeddings and outperforms the state-of-the-art on the paraphrase generation and sentiment analysis task on standard datasets. These results are also shown to be statistically significant.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes