IR CLAug 6, 2020

DeText: A Deep Text Ranking Framework with BERT

Weiwei Guo, Xiaowei Liu, Sida Wang, Huiji Gao, Ananth Sankar, Zimeng Yang, Qi Guo, Liang Zhang, Bo Long, Bee-Chung Chen, Deepak Agarwal

arXiv:2008.02460v114.430 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the problem of slow BERT-based ranking for search systems, offering an incremental efficiency improvement for industry applications.

The paper tackles the inefficiency of BERT-based ranking models in search systems by developing DeText, a framework that improves efficiency for industry use, with offline and online experiments showing significant improvements over state-of-the-art approaches.

Ranking is the most important component in a search system. Mostsearch systems deal with large amounts of natural language data,hence an effective ranking system requires a deep understandingof text semantics. Recently, deep learning based natural languageprocessing (deep NLP) models have generated promising results onranking systems. BERT is one of the most successful models thatlearn contextual embedding, which has been applied to capturecomplex query-document relations for search ranking. However,this is generally done by exhaustively interacting each query wordwith each document word, which is inefficient for online servingin search product systems. In this paper, we investigate how tobuild an efficient BERT-based ranking model for industry use cases.The solution is further extended to a general ranking framework,DeText, that is open sourced and can be applied to various rankingproductions. Offline and online experiments of DeText on threereal-world search systems present significant improvement overstate-of-the-art approaches.

View on arXiv PDF Code

Similar