IR CLDec 10, 2021

Match Your Words! A Study of Lexical Matching in Neural Information Retrieval

Thibault Formal, Benjamin Piwowarski, Stéphane Clinchant

arXiv:2112.05662v211.232 citations

Originality Incremental advance

AI Analysis

This addresses a key limitation in neural IR models for search engines, highlighting their poor generalization compared to lexical methods, which is incremental as it builds on existing benchmarks like BEIR.

The study investigated whether neural information retrieval models can effectively perform lexical matching, particularly for important terms, and found that they fail to properly generalize term importance on out-of-domain collections or terms rarely seen during training.

Neural Information Retrieval models hold the promise to replace lexical matching models, e.g. BM25, in modern search engines. While their capabilities have fully shone on in-domain datasets like MS MARCO, they have recently been challenged on out-of-domain zero-shot settings (BEIR benchmark), questioning their actual generalization capabilities compared to bag-of-words approaches. Particularly, we wonder if these shortcomings could (partly) be the consequence of the inability of neural IR models to perform lexical matching off-the-shelf. In this work, we propose a measure of discrepancy between the lexical matching performed by any (neural) model and an 'ideal' one. Based on this, we study the behavior of different state-of-the-art neural IR models, focusing on whether they are able to perform lexical matching when it's actually useful, i.e. for important terms. Overall, we show that neural IR models fail to properly generalize term importance on out-of-domain collections or terms almost unseen during training

View on arXiv PDF

Similar