CLFeb 17, 2022

A Survey of Knowledge-Intensive NLP with Pre-Trained Language Models

Da Yin, Li Dong, Hao Cheng, Xiaodong Liu, Kai-Wei Chang, Furu Wei, Jianfeng Gao

arXiv:2202.08772v16.238 citations

Originality Synthesis-oriented

AI Analysis

This is an incremental survey for NLP practitioners, summarizing existing methods without introducing new techniques.

This paper surveys knowledge-enhanced pre-trained language models (PLMKEs) to address their limitations in handling knowledge-intensive NLP tasks, summarizing progress through three key elements and outlining challenges and future directions.

With the increasing of model capacity brought by pre-trained language models, there emerges boosting needs for more knowledgeable natural language processing (NLP) models with advanced functionalities including providing and making flexible use of encyclopedic and commonsense knowledge. The mere pre-trained language models, however, lack the capacity of handling such knowledge-intensive NLP tasks alone. To address this challenge, large numbers of pre-trained language models augmented with external knowledge sources are proposed and in rapid development. In this paper, we aim to summarize the current progress of pre-trained language model-based knowledge-enhanced models (PLMKEs) by dissecting their three vital elements: knowledge sources, knowledge-intensive NLP tasks, and knowledge fusion methods. Finally, we present the challenges of PLMKEs based on the discussion regarding the three elements and attempt to provide NLP practitioners with potential directions for further research.

View on arXiv PDF

Similar