CL AIDec 6, 2021

A New Sentence Extraction Strategy for Unsupervised Extractive Summarization Methods

Dehao Tao, Yingzhu Xiong, Zhongliang Yang, Yongfeng Huang

arXiv:2112.03203v5

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of text summarization without large datasets, but it appears incremental as it builds on existing unsupervised methods.

The paper tackles the problem of unsupervised extractive text summarization by proposing a new sentence extraction strategy based on Information Theory to improve feature distribution and reduce mutual information, with experiments showing it is effective as expected.

In recent years, text summarization methods have attracted much attention again thanks to the researches on neural network models. Most of the current text summarization methods based on neural network models are supervised methods which need large-scale datasets. However, large-scale datasets are difficult to obtain in practical applications. In this paper, we model the task of extractive text summarization methods from the perspective of Information Theory, and then describe the unsupervised extractive methods with a uniform framework. To improve the feature distribution and to decrease the mutual information of summarization sentences, we propose a new sentence extraction strategy which can be applied to existing unsupervised extractive methods. Experiments are carried out on different datasets, and results show that our strategy is indeed effective and in line with expectations.

View on arXiv PDF

Similar