CLFeb 18, 2020

Interpretable Multi-Headed Attention for Abstractive Summarization at Controllable Lengths

arXiv:2002.07845v2994 citations
AI Analysis

This addresses the need for interpretable and length-controllable summarization in scenarios with limited data, though it is incremental as it builds on existing attention mechanisms.

The paper tackles the problem of generating abstractive summaries at controllable lengths, especially in low-resource domains, by proposing a method that outperforms strong baselines by up to 14.70% in METEOR score.

Abstractive summarization at controllable lengths is a challenging task in natural language processing. It is even more challenging for domains where limited training data is available or scenarios in which the length of the summary is not known beforehand. At the same time, when it comes to trusting machine-generated summaries, explaining how a summary was constructed in human-understandable terms may be critical. We propose Multi-level Summarizer (MLS), a supervised method to construct abstractive summaries of a text document at controllable lengths. The key enabler of our method is an interpretable multi-headed attention mechanism that computes attention distribution over an input document using an array of timestep independent semantic kernels. Each kernel optimizes a human-interpretable syntactic or semantic property. Exhaustive experiments on two low-resource datasets in the English language show that MLS outperforms strong baselines by up to 14.70% in the METEOR score. Human evaluation of the summaries also suggests that they capture the key concepts of the document at various length-budgets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes