CLAIMay 23, 2023

Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document

arXiv:2305.13850v3131 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses a bottleneck in visual relation extraction for document analysis, offering incremental improvements by integrating global structure to enhance model performance and efficiency.

The paper tackles the problem of extracting relationships between entities in visually-rich documents by addressing the neglect of global structural information in existing methods, proposing the GOSE framework which iteratively incorporates global structure knowledge into entity representations, resulting in outperforming existing methods in standard fine-tuning, cross-lingual learning, and low-resource settings.

Visual Relation Extraction (VRE) is a powerful means of discovering relationships between entities within visually-rich documents. Existing methods often focus on manipulating entity features to find pairwise relations, yet neglect the more fundamental structural information that links disparate entity pairs together. The absence of global structure information may make the model struggle to learn long-range relations and easily predict conflicted results. To alleviate such limitations, we propose a GlObal Structure knowledge-guided relation Extraction (GOSE) framework. GOSE initiates by generating preliminary relation predictions on entity pairs extracted from a scanned image of the document. Subsequently, global structural knowledge is captured from the preceding iterative predictions, which are then incorporated into the representations of the entities. This "generate-capture-incorporate" cycle is repeated multiple times, allowing entity representations and global structure knowledge to be mutually reinforced. Extensive experiments validate that GOSE not only outperforms existing methods in the standard fine-tuning setting but also reveals superior cross-lingual learning capabilities; indeed, even yields stronger data-efficient performance in the low-resource setting. The code for GOSE will be available at https://github.com/chenxn2020/GOSE.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes