CL AINov 5, 2019

A Joint Model for Definition Extraction with Syntactic Connection and Semantic Consistency

Amir Pouran Ben Veyseh, Franck Dernoncourt, Dejing Dou, Thien Huu Nguyen

arXiv:1911.01678v43.245 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses a specific bottleneck in Information Extraction for researchers and practitioners by integrating previously separate tasks, though it appears incremental as it builds on existing DE methods with novel architectural enhancements.

The paper tackles the problem of Definition Extraction (DE) by proposing a joint model that simultaneously performs sentence classification and sequential labeling to leverage their inter-dependencies, resulting in improved representation vectors through the use of deep learning architectures, syntactic connections, and semantic consistency.

Definition Extraction (DE) is one of the well-known topics in Information Extraction that aims to identify terms and their corresponding definitions in unstructured texts. This task can be formalized either as a sentence classification task (i.e., containing term-definition pairs or not) or a sequential labeling task (i.e., identifying the boundaries of the terms and definitions). The previous works for DE have only focused on one of the two approaches, failing to model the inter-dependencies between the two tasks. In this work, we propose a novel model for DE that simultaneously performs the two tasks in a single framework to benefit from their inter-dependencies. Our model features deep learning architectures to exploit the global structures of the input sentences as well as the semantic consistencies between the terms and the definitions, thereby improving the quality of the representation vectors for DE. Besides the joint inference between sentence classification and sequential labeling, the proposed model is fundamentally different from the prior work for DE in that the prior work has only employed the local structures of the input sentences (i.e., word-to-word relations), and not yet considered the semantic consistencies between terms and definitions. In order to implement these novel ideas, our model presents a multi-task learning framework that employs graph convolutional neural networks and predicts the dependency paths between the terms and the definitions. We also seek to enforce the consistency between the representations of the terms and definitions both globally (i.e., increasing semantic consistency between the representations of the entire sentences and the terms/definitions) and locally (i.e., promoting the similarity between the representations of the terms and the definitions).

View on arXiv PDF Code

Similar