CLLGJun 22, 2015

Skip-Thought Vectors

arXiv:1506.06726v12482 citations
Originality Incremental advance
AI Analysis

This provides a robust, generic sentence encoder for natural language processing tasks, though it is incremental in building on existing unsupervised learning approaches.

The authors tackled unsupervised learning of generic sentence representations by training an encoder-decoder model on book text to reconstruct surrounding sentences, resulting in an off-the-shelf encoder that performed well on 8 tasks including semantic relatedness and sentiment analysis.

We describe an approach for unsupervised learning of a generic, distributed sentence encoder. Using the continuity of text from books, we train an encoder-decoder model that tries to reconstruct the surrounding sentences of an encoded passage. Sentences that share semantic and syntactic properties are thus mapped to similar vector representations. We next introduce a simple vocabulary expansion method to encode words that were not seen as part of training, allowing us to expand our vocabulary to a million words. After training our model, we extract and evaluate our vectors with linear models on 8 tasks: semantic relatedness, paraphrase detection, image-sentence ranking, question-type classification and 4 benchmark sentiment and subjectivity datasets. The end result is an off-the-shelf encoder that can produce highly generic sentence representations that are robust and perform well in practice. We will make our encoder publicly available.

Code Implementations16 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes