CLFeb 25, 2020

A more abstractive summarization model

Satyaki Chakraborty, Xinya Li, Sayak Chakraborty

arXiv:2002.10959v10.2

Originality Synthesis-oriented

AI Analysis

This addresses a specific limitation in abstractive summarization models for NLP researchers, but it is incremental as it builds on existing pointer-generator architectures.

The paper tackled the problem of pointer-generator networks in text summarization being unable to generate novel words, and by adding an Out-of-vocabulary (OOV) penalty, it significantly improved novelty/abstraction scores while reporting ROUGE metrics.

Pointer-generator network is an extremely popular method of text summarization. More recent works in this domain still build on top of the baseline pointer generator by augmenting a content selection phase, or by decomposing the decoder into a contextual network and a language model. However, all such models that are based on the pointer-generator base architecture cannot generate novel words in the summary and mostly copy words from the source text. In our work, we first thoroughly investigate why the pointer-generator network is unable to generate novel words, and then address that by adding an Out-of-vocabulary (OOV) penalty. This enables us to improve the amount of novelty/abstraction significantly. We use normalized n-gram novelty scores as a metric for determining the level of abstraction. Moreover, we also report rouge scores of our model since most summarization models are evaluated with R-1, R-2, R-L scores.

View on arXiv PDF

Similar