IR CL LGJul 20, 2020

Conformer-Kernel with Query Term Independence for Document Retrieval

Bhaskar Mitra, Sebastian Hofstatter, Hamed Zamani, Nick Craswell

arXiv:2007.10434v114.416 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses efficiency challenges in neural retrieval systems for applications handling long documents, though it appears incremental.

The authors tackled the problem of scaling Transformer-Kernel models for full document retrieval by proposing a Conformer layer with linear memory scaling and incorporating query term independence, achieving preliminary results showing improved viability for long documents.

The Transformer-Kernel (TK) model has demonstrated strong reranking performance on the TREC Deep Learning benchmark---and can be considered to be an efficient (but slightly less effective) alternative to BERT-based ranking models. In this work, we extend the TK architecture to the full retrieval setting by incorporating the query term independence assumption. Furthermore, to reduce the memory complexity of the Transformer layers with respect to the input sequence length, we propose a new Conformer layer. We show that the Conformer's GPU memory requirement scales linearly with input sequence length, making it a more viable option when ranking long documents. Finally, we demonstrate that incorporating explicit term matching signal into the model can be particularly useful in the full retrieval setting. We present preliminary results from our work in this paper.

View on arXiv PDF Code

Similar