CLAIMay 21, 2021

Pretrained Language Models for Text Generation: A Survey

arXiv:2105.10311v2208 citations
Originality Synthesis-oriented
AI Analysis

It synthesizes existing research for text generation researchers, serving as a pointer to related work, but is incremental as it does not introduce novel methods or findings.

This survey paper provides an overview of pretrained language models (PLMs) for text generation, summarizing advances in adapting PLMs to various inputs and fine-tuning strategies, without presenting new experimental results.

Text generation has become one of the most important yet challenging tasks in natural language processing (NLP). The resurgence of deep learning has greatly advanced this field by neural generation models, especially the paradigm of pretrained language models (PLMs). In this paper, we present an overview of the major advances achieved in the topic of PLMs for text generation. As the preliminaries, we present the general task definition and briefly describe the mainstream architectures of PLMs for text generation. As the core content, we discuss how to adapt existing PLMs to model different input data and satisfy special properties in the generated text. We further summarize several important fine-tuning strategies for text generation. Finally, we present several future directions and conclude this paper. Our survey aims to provide text generation researchers a synthesis and pointer to related research.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes