CL AISep 2, 2024

Pre-Trained Language Models for Keyphrase Prediction: A Review

Muhammad Umair, Tangina Sultana, Young-Koo Lee

arXiv:2409.01087v14.812 citationsh-index: 13

Originality Synthesis-oriented

AI Analysis

It fills a gap in the literature for researchers in NLP by synthesizing existing work on keyphrase prediction, but is incremental as it is a survey.

This review paper addresses the lack of a comprehensive analysis of pre-trained language models for both keyphrase extraction and generation, providing a unified taxonomy and future directions for these tasks.

Keyphrase Prediction (KP) is essential for identifying keyphrases in a document that can summarize its content. However, recent Natural Language Processing (NLP) advances have developed more efficient KP models using deep learning techniques. The limitation of a comprehensive exploration jointly both keyphrase extraction and generation using pre-trained language models spotlights a critical gap in the literature, compelling our survey paper to bridge this deficiency and offer a unified and in-depth analysis to address limitations in previous surveys. This paper extensively examines the topic of pre-trained language models for keyphrase prediction (PLM-KP), which are trained on large text corpora via different learning (supervisor, unsupervised, semi-supervised, and self-supervised) techniques, to provide respective insights into these two types of tasks in NLP, precisely, Keyphrase Extraction (KPE) and Keyphrase Generation (KPG). We introduce appropriate taxonomies for PLM-KPE and KPG to highlight these two main tasks of NLP. Moreover, we point out some promising future directions for predicting keyphrases.

View on arXiv PDF

Similar