IROct 18, 2019

The Bitwise Hashing Trick for Personalized Search

arXiv:1910.08646v11 citations
Originality Incremental advance
AI Analysis

This addresses efficiency challenges in personalized search for users, though it is incremental as it builds on existing hashing methods.

The paper tackles the problem of fast lexical comparison for personalized search by introducing feature bit vectors using the hashing trick, resulting in an order of magnitude reduction in data structure size and compute time with only a small accuracy decrease.

Many real world problems require fast and efficient lexical comparison of large numbers of short text strings. Search personalization is one such domain. We introduce the use of feature bit vectors using the hashing trick for improving relevance in personalized search and other personalization applications. We present results of several lexical hashing and comparison methods. These methods are applied to a user's historical behavior and are used to predict future behavior. Using a single bit per dimension instead of floating point results in an order of magnitude decrease in data structure size, while preserving or even improving quality. We use real data to simulate a search personalization task. A simple method for combining bit vectors demonstrates an order of magnitude improvement in compute time on the task with only a small decrease in accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes