IRCLLGAug 15, 2019

Hamming Sentence Embeddings for Information Retrieval

arXiv:1908.05541v11 citations
AI Analysis

This work addresses memory and speed improvements in retrieval applications, though it is incremental as it builds on existing hashing and embedding methods.

The paper tackles the problem of compressing sentence embeddings for information retrieval by using a neural encoder-decoder to produce binary hashes in Hamming space, achieving comparable performance to uncompressed embeddings with compression ratios up to 256:1 on semantic similarity benchmarks.

In retrieval applications, binary hashes are known to offer significant improvements in terms of both memory and speed. We investigate the compression of sentence embeddings using a neural encoder-decoder architecture, which is trained by minimizing reconstruction error. Instead of employing the original real-valued embeddings, we use latent representations in Hamming space produced by the encoder for similarity calculations. In quantitative experiments on several benchmarks for semantic similarity tasks, we show that our compressed hamming embeddings yield a comparable performance to uncompressed embeddings (Sent2Vec, InferSent, Glove-BoW), at compression ratios of up to 256:1. We further demonstrate that our model strongly decorrelates input features, and that the compressor generalizes well when pre-trained on Wikipedia sentences. We publish the source code on Github and all experimental results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes