CLApr 8, 2020

Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation

arXiv:2004.03875v21000 citations
Originality Highly original
AI Analysis

This addresses the need for personalized and varied headline generation in news media, offering a more flexible approach compared to existing single-headline methods.

The paper tackles the problem of generating multiple, diverse headlines for a single news article by incorporating user-specified keyphrases, achieving state-of-the-art results in quality and diversity on a real-world dataset. It introduces a method that first generates keyphrases and then uses a multi-source Transformer decoder to produce headlines, supported by a new large-scale corpus of over 180K aligned triples.

News headline generation aims to produce a short sentence to attract readers to read the news. One news article often contains multiple keyphrases that are of interest to different users, which can naturally have multiple reasonable headlines. However, most existing methods focus on the single headline generation. In this paper, we propose generating multiple headlines with keyphrases of user interests, whose main idea is to generate multiple keyphrases of interest to users for the news first, and then generate multiple keyphrase-relevant headlines. We propose a multi-source Transformer decoder, which takes three sources as inputs: (a) keyphrase, (b) keyphrase-filtered article, and (c) original article to generate keyphrase-relevant, high-quality, and diverse headlines. Furthermore, we propose a simple and effective method to mine the keyphrases of interest in the news article and build a first large-scale keyphrase-aware news headline corpus, which contains over 180K aligned triples of $<$news article, headline, keyphrase$>$. Extensive experimental comparisons on the real-world dataset show that the proposed method achieves state-of-the-art results in terms of quality and diversity

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes