BibRank: Automatic Keyphrase Extraction Platform Using~Metadata
This provides a platform for researchers and developers in natural language processing to enhance keyphrase extraction algorithms, though it is incremental as it builds on existing techniques.
The paper tackles the problem of automatic keyphrase extraction by introducing BibRank, a platform that integrates datasets and an algorithm using bibliographic metadata, resulting in a tool that facilitates evaluation and improvement of extraction methods for researchers and developers.
Automatic Keyphrase Extraction involves identifying essential phrases in a document. These keyphrases are crucial in various tasks such as document classification, clustering, recommendation, indexing, searching, summarization, and text simplification. This paper introduces a platform that integrates keyphrase datasets and facilitates the evaluation of keyphrase extraction algorithms. The platform includes BibRank, an automatic keyphrase extraction algorithm that leverages a rich dataset obtained by parsing bibliographic data in BibTeX format. BibRank combines innovative weighting techniques with positional, statistical, and word co-occurrence information to extract keyphrases from documents. The platform proves valuable for researchers and developers seeking to enhance their keyphrase extraction algorithms and advance the field of natural language processing.