IRFeb 1, 2022

Improving BERT-based Query-by-Document Retrieval with Multi-Task Optimization

arXiv:2202.00373v235 citations
Originality Incremental advance
AI Analysis

This work addresses retrieval challenges in professional search tasks, but it is incremental as it builds on existing BERT re-rankers.

The paper tackled the problem of improving query-by-document retrieval by extending BERT fine-tuning with a multi-task optimization approach, resulting in significant gains in ranking effectiveness on two benchmarks without altering the model or adding training data.

Query-by-document (QBD) retrieval is an Information Retrieval task in which a seed document acts as the query and the goal is to retrieve related documents -- it is particular common in professional search tasks. In this work we improve the retrieval effectiveness of the BERT re-ranker, proposing an extension to its fine-tuning step to better exploit the context of queries. To this end, we use an additional document-level representation learning objective besides the ranking objective when fine-tuning the BERT re-ranker. Our experiments on two QBD retrieval benchmarks show that the proposed multi-task optimization significantly improves the ranking effectiveness without changing the BERT re-ranker or using additional training samples. In future work, the generalizability of our approach to other retrieval tasks should be further investigated.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes