W2KPE: Keyphrase Extraction with Word-Word Relation
This work addresses keyphrase extraction for conference theme relevance, but it is incremental as it builds on existing methods with specific optimizations.
The paper tackled keyphrase extraction from conference materials by modeling it as a Named Entity Recognition task and introduced techniques like data preprocessing, loss function replacement, and scoring methods, achieving a score of 45.04 on the final test set.
This paper describes our submission to ICASSP 2023 MUG Challenge Track 4, Keyphrase Extraction, which aims to extract keyphrases most relevant to the conference theme from conference materials. We model the challenge as a single-class Named Entity Recognition task and developed techniques for better performance on the challenge: For the data preprocessing, we encode the split keyphrases after word segmentation. In addition, we increase the amount of input information that the model can accept at one time by fusing multiple preprocessed sentences into one segment. We replace the loss function with the multi-class focal loss to address the sparseness of keyphrases. Besides, we score each appearance of keyphrases and add an extra output layer to fit the score to rank keyphrases. Exhaustive evaluations are performed to find the best combination of the word segmentation tool, the pre-trained embedding model, and the corresponding hyperparameters. With these proposals, we scored 45.04 on the final test set.