Learning Open Information Extraction of Implicit Relations from Reading Comprehension Datasets
This addresses the low recall issue for implicit relation extraction in OpenIE, which is important for NLP applications like knowledge base construction, but it is incremental as it builds on existing datasets and methods.
The paper tackled the problem of extracting implicit relations from sentences, where relationships between entities are implied rather than explicitly stated, by creating a new dataset 35 times larger than previous ones using reading comprehension data, and a baseline neural model trained on this data outperformed prior methods on the implicit extraction task.
The relationship between two entities in a sentence is often implied by word order and common sense, rather than an explicit predicate. For example, it is evident that "Fed chair Powell indicates rate hike" implies (Powell, is a, Fed chair) and (Powell, works for, Fed). These tuples are just as significant as the explicit-predicate tuple (Powell, indicates, rate hike), but have much lower recall under traditional Open Information Extraction (OpenIE) systems. Implicit tuples are our term for this type of extraction where the relation is not present in the input sentence. There is very little OpenIE training data available relative to other NLP tasks and none focused on implicit relations. We develop an open source, parse-based tool for converting large reading comprehension datasets to OpenIE datasets and release a dataset 35x larger than previously available by sentence count. A baseline neural model trained on this data outperforms previous methods on the implicit extraction task.