CLAug 4, 2021

Multi-Round Parsing-based Multiword Rules for Scientific OpenIE

arXiv:2108.02074v1
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of multiword expression boundary identification in scientific OpenIE, but it appears incremental as it builds on existing parsing methods without a major breakthrough.

The authors tackled the problem of extracting structured information from scientific literature without requiring annotated training data by proposing a set of dependency parsing-based rules for OpenIE, showing effectiveness on novel datasets.

Information extraction (IE) in scientific literature has facilitated many down-stream tasks. OpenIE, which does not require any relation schema but identifies a relational phrase to describe the relationship between a subject and an object, is being a trending topic of IE in sciences. The subjects, objects, and relations are often multiword expressions, which brings challenges for methods to identify the boundaries of the expressions given very limited or even no training data. In this work, we present a set of rules for extracting structured information based on dependency parsing that can be applied to any scientific dataset requiring no expert's annotation. Results on novel datasets show the effectiveness of the proposed method. We discuss negative results as well.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes