CLMay 16, 2024

Hierarchical Attention Graph for Scientific Document Summarization in Global and Local Level

Chenlong Zhao, Xiwen Zhou, Xiaopeng Xie, Yong Zhang

arXiv:2405.10202v131 citationsh-index: 1Has CodeNAACL-HLT

Originality Incremental advance

AI Analysis

This addresses the problem of insufficient semantic learning in extractive summarization for scientific documents, though it appears incremental as it builds on existing graph-based methods.

The paper tackles the challenge of summarizing long scientific documents by simultaneously modeling global and local relations, proposing HAESum which uses graph neural networks and achieves effective results on two benchmark datasets.

Scientific document summarization has been a challenging task due to the long structure of the input text. The long input hinders the simultaneous effective modeling of both global high-order relations between sentences and local intra-sentence relations which is the most critical step in extractive summarization. However, existing methods mostly focus on one type of relation, neglecting the simultaneous effective modeling of both relations, which can lead to insufficient learning of semantic representations. In this paper, we propose HAESum, a novel approach utilizing graph neural networks to locally and globally model documents based on their hierarchical discourse structure. First, intra-sentence relations are learned using a local heterogeneous graph. Subsequently, a novel hypergraph self-attention layer is introduced to further enhance the characterization of high-order inter-sentence relations. We validate our approach on two benchmark datasets, and the experimental results demonstrate the effectiveness of HAESum and the importance of considering hierarchical structures in modeling long scientific documents. Our code will be available at \url{https://github.com/MoLICHENXI/HAESum}

View on arXiv PDF Code

Similar