CLSep 15, 2020

Global-aware Beam Search for Neural Abstractive Summarization

arXiv:2009.06891v513 citations
Originality Incremental advance
AI Analysis

This addresses a key bottleneck in text generation for summarization tasks, though it is an incremental improvement over existing beam search methods.

The study tackled the local optimality problem in beam search for neural abstractive summarization by developing a global-aware algorithm that uses attention distribution to guide inference, resulting in significant improvements on nine datasets with state-of-the-art models.

This study develops a calibrated beam-based algorithm with awareness of the global attention distribution for neural abstractive summarization, aiming to improve the local optimality problem of the original beam search in a rigorous way. Specifically, a novel global protocol is proposed based on the attention distribution to stipulate how a global optimal hypothesis should attend to the source. A global scoring mechanism is then developed to regulate beam search to generate summaries in a near-global optimal fashion. This novel design enjoys a distinctive property, i.e., the global attention distribution could be predicted before inference, enabling step-wise improvements on the beam search through the global scoring mechanism. Extensive experiments on nine datasets show that the global (attention)-aware inference significantly improves state-of-the-art summarization models even using empirical hyper-parameters. The algorithm is also proven robust as it remains to generate meaningful texts with corrupted attention distributions. The codes and a comprehensive set of examples are available.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes