CLJun 9, 2017

Trimming and Improving Skip-thought Vectors

Shuai Tang, Hailin Jin, Chen Fang, Zhaowen Wang, Virginia R. de Sa

arXiv:1706.03148v11.64 citations

Originality Incremental advance

AI Analysis

This work provides an incremental improvement to sentence representation models for natural language processing tasks.

The authors proposed techniques to trim and improve skip-thought vectors for sentence representation learning, showing that their model is faster, lighter-weight, and equally powerful as the original.

The skip-thought model has been proven to be effective at learning sentence representations and capturing sentence semantics. In this paper, we propose a suite of techniques to trim and improve it. First, we validate a hypothesis that, given a current sentence, inferring the previous and inferring the next sentence provide similar supervision power, therefore only one decoder for predicting the next sentence is preserved in our trimmed skip-thought model. Second, we present a connection layer between encoder and decoder to help the model to generalize better on semantic relatedness tasks. Third, we found that a good word embedding initialization is also essential for learning better sentence representations. We train our model unsupervised on a large corpus with contiguous sentences, and then evaluate the trained model on 7 supervised tasks, which includes semantic relatedness, paraphrase detection, and text classification benchmarks. We empirically show that, our proposed model is a faster, lighter-weight and equally powerful alternative to the original skip-thought model.

View on arXiv PDF

Similar