CLLGJan 4, 2023

A comprehensive review of automatic text summarization techniques: method, data, evaluation and coding

arXiv:2301.03403v421 citationsh-index: 35
Originality Synthesis-oriented
AI Analysis

It offers a comprehensive overview for researchers and practitioners in natural language processing, but it is incremental as it synthesizes existing work.

This paper provides a literature review of automatic text summarization techniques, covering methods, datasets, evaluation, and coding, and includes an empirical exploration using the CNN Corpus dataset.

We provide a literature review about Automatic Text Summarization (ATS) systems. We consider a citation-based approach. We start with some popular and well-known papers that we have in hand about each topic we want to cover and we have tracked the "backward citations" (papers that are cited by the set of papers we knew beforehand) and the "forward citations" (newer papers that cite the set of papers we knew beforehand). In order to organize the different methods, we present the diverse approaches to ATS guided by the mechanisms they use to generate a summary. Besides presenting the methods, we also present an extensive review of the datasets available for summarization tasks and the methods used to evaluate the quality of the summaries. Finally, we present an empirical exploration of these methods using the CNN Corpus dataset that provides golden summaries for extractive and abstractive methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes