IRSep 11, 2017

A Short Note on Proximity-based Scoring of Documents with Multiple Fields

arXiv:1709.03260v1
AI Analysis

This work addresses document retrieval challenges for search engines by proposing an incremental improvement to existing ranking functions.

The paper tackled the problem of scoring documents with multiple fields by combining BM25F and Expanded Span methods to incorporate term proximity, resulting in a new scoring method for such documents.

The BM25 ranking function is one of the most well known query relevance document scoring functions and many variations of it are proposed. The BM25F function is one of its adaptations designed for modeling documents with multiple fields. The Expanded Span method extends a BM25-like function by taking into considerations of the proximity between term occurrences. In this note, we combine these two variations into one scoring method in view of proximity-based scoring of documents with multiple fields.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes