IRApr 6, 2017

Fixed versus Dynamic Co-Occurrence Windows in TextRank Term Weights for Information Retrieval

arXiv:1704.01851v15 citations
Originality Incremental advance
AI Analysis

This work addresses the need for better ranking functions in information retrieval, but it is incremental as it modifies an existing method (TextRank) with a specific adjustment.

The paper tackled the problem of improving information retrieval precision by proposing dynamically adjusted co-occurrence windows in TextRank term weights, which led to gains in early precision on two IR collections.

TextRank is a variant of PageRank typically used in graphs that represent documents, and where vertices denote terms and edges denote relations between terms. Quite often the relation between terms is simple term co-occurrence within a fixed window of k terms. The output of TextRank when applied iteratively is a score for each vertex, i.e. a term weight, that can be used for information retrieval (IR) just like conventional term frequency based term weights. So far, when computing TextRank term weights over co- occurrence graphs, the window of term co-occurrence is al- ways ?xed. This work departs from this, and considers dy- namically adjusted windows of term co-occurrence that fol- low the document structure on a sentence- and paragraph- level. The resulting TextRank term weights are used in a ranking function that re-ranks 1000 initially returned search results in order to improve the precision of the ranking. Ex- periments with two IR collections show that adjusting the vicinity of term co-occurrence when computing TextRank term weights can lead to gains in early precision.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes