IRCLAug 6, 2020

DeText: A Deep Text Ranking Framework with BERT

arXiv:2008.02460v130 citationsHas Code
AI Analysis

This addresses the problem of slow BERT-based ranking for search systems, offering an incremental efficiency improvement for industry applications.

The paper tackles the inefficiency of BERT-based ranking models in search systems by developing DeText, a framework that improves efficiency for industry use, with offline and online experiments showing significant improvements over state-of-the-art approaches.

Ranking is the most important component in a search system. Mostsearch systems deal with large amounts of natural language data,hence an effective ranking system requires a deep understandingof text semantics. Recently, deep learning based natural languageprocessing (deep NLP) models have generated promising results onranking systems. BERT is one of the most successful models thatlearn contextual embedding, which has been applied to capturecomplex query-document relations for search ranking. However,this is generally done by exhaustively interacting each query wordwith each document word, which is inefficient for online servingin search product systems. In this paper, we investigate how tobuild an efficient BERT-based ranking model for industry use cases.The solution is further extended to a general ranking framework,DeText, that is open sourced and can be applied to various rankingproductions. Offline and online experiments of DeText on threereal-world search systems present significant improvement overstate-of-the-art approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes