CLIRLGNov 13, 2021

Keyphrase Extraction Using Neighborhood Knowledge Based on Word Embeddings

arXiv:2111.07198v12 citations
Originality Incremental advance
AI Analysis

This work addresses the limitation of existing keyphrase extraction methods for researchers and practitioners by providing a more accurate way to identify main topics in documents, though it is incremental as it builds on established graph-based models.

The paper tackled the problem of keyphrase extraction by enhancing graph-based ranking models with word embeddings to capture semantic relationships beyond co-occurrence, resulting in improved performance on benchmark datasets.

Keyphrase extraction is the task of finding several interesting phrases in a text document, which provide a list of the main topics within the document. Most existing graph-based models use co-occurrence links as cohesion indicators to model the relationship of syntactic elements. However, a word may have different forms of expression within the document, and may have several synonyms as well. Simply using co-occurrence information cannot capture this information. In this paper, we enhance the graph-based ranking model by leveraging word embeddings as background knowledge to add semantic information to the inter-word graph. Our approach is evaluated on established benchmark datasets and empirical results show that the word embedding neighborhood information improves the model performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes