CLSep 16, 2019

BottleSum: Unsupervised and Self-supervised Sentence Summarization using the Information Bottleneck Principle

arXiv:1909.07405v21020 citations
AI Analysis

This addresses the problem of generating sentence summaries without labeled data for NLP researchers and practitioners, though it is incremental as it builds on existing principles and methods.

The paper tackles unsupervised and self-supervised sentence summarization by applying the Information Bottleneck principle to predict the next sentence, resulting in an extractive method that outperforms other unsupervised models on automatic metrics and a self-supervised abstractive model that surpasses baselines in human evaluations.

The principle of the Information Bottleneck (Tishby et al. 1999) is to produce a summary of information X optimized to predict some other relevant information Y. In this paper, we propose a novel approach to unsupervised sentence summarization by mapping the Information Bottleneck principle to a conditional language modelling objective: given a sentence, our approach seeks a compressed sentence that can best predict the next sentence. Our iterative algorithm under the Information Bottleneck objective searches gradually shorter subsequences of the given sentence while maximizing the probability of the next sentence conditioned on the summary. Using only pretrained language models with no direct supervision, our approach can efficiently perform extractive sentence summarization over a large corpus. Building on our unsupervised extractive summarization (BottleSumEx), we then present a new approach to self-supervised abstractive summarization (BottleSumSelf), where a transformer-based language model is trained on the output summaries of our unsupervised method. Empirical results demonstrate that our extractive method outperforms other unsupervised models on multiple automatic metrics. In addition, we find that our self-supervised abstractive model outperforms unsupervised baselines (including our own) by human evaluation along multiple attributes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes