CLMay 16, 2024

Hierarchical Attention Graph for Scientific Document Summarization in Global and Local Level

arXiv:2405.10202v131 citationsh-index: 1Has CodeNAACL-HLT
Originality Incremental advance
AI Analysis

This addresses the problem of insufficient semantic learning in extractive summarization for scientific documents, though it appears incremental as it builds on existing graph-based methods.

The paper tackles the challenge of summarizing long scientific documents by simultaneously modeling global and local relations, proposing HAESum which uses graph neural networks and achieves effective results on two benchmark datasets.

Scientific document summarization has been a challenging task due to the long structure of the input text. The long input hinders the simultaneous effective modeling of both global high-order relations between sentences and local intra-sentence relations which is the most critical step in extractive summarization. However, existing methods mostly focus on one type of relation, neglecting the simultaneous effective modeling of both relations, which can lead to insufficient learning of semantic representations. In this paper, we propose HAESum, a novel approach utilizing graph neural networks to locally and globally model documents based on their hierarchical discourse structure. First, intra-sentence relations are learned using a local heterogeneous graph. Subsequently, a novel hypergraph self-attention layer is introduced to further enhance the characterization of high-order inter-sentence relations. We validate our approach on two benchmark datasets, and the experimental results demonstrate the effectiveness of HAESum and the importance of considering hierarchical structures in modeling long scientific documents. Our code will be available at \url{https://github.com/MoLICHENXI/HAESum}

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes