CLOct 10, 2022

Leveraging Key Information Modeling to Improve Less-Data Constrained News Headline Generation via Duality Fine-Tuning

arXiv:2210.04473v1297 citationsh-index: 20
AI Analysis

This work addresses the challenge of less-data constrained generation for news headline creation, which is incremental in improving efficiency and effectiveness in low-resource scenarios.

The paper tackles the problem of generating readable and informative news headlines with limited training data by proposing a duality fine-tuning method that models key information and enforces probabilistic constraints between prediction and generation tasks. The method achieves improved performance on language modeling and informativeness metrics across two public datasets.

Recent language generative models are mostly trained on large-scale datasets, while in some real scenarios, the training datasets are often expensive to obtain and would be small-scale. In this paper we investigate the challenging task of less-data constrained generation, especially when the generated news headlines are short yet expected by readers to keep readable and informative simultaneously. We highlight the key information modeling task and propose a novel duality fine-tuning method by formally defining the probabilistic duality constraints between key information prediction and headline generation tasks. The proposed method can capture more information from limited data, build connections between separate tasks, and is suitable for less-data constrained generation tasks. Furthermore, the method can leverage various pre-trained generative regimes, e.g., autoregressive and encoder-decoder models. We conduct extensive experiments to demonstrate that our method is effective and efficient to achieve improved performance in terms of language modeling metric and informativeness correctness metric on two public datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes