Pynsett: A programmable relation extractor
This method addresses the problem of relation extraction for users needing to define custom rules in specialized domains, though it is incremental as it builds on existing semantic parsing techniques.
The paper tackles the challenge of relation extraction by introducing a programmable method that uses semantic graphs and plain English rules, enabling flexible extraction of specialized ontologies from limited document collections without requiring large training corpora.
This paper proposes a programmable relation extraction method for the English language by parsing texts into semantic graphs. A person can define rules in plain English that act as matching patterns onto the graph representation. These rules are designed to capture the semantic content of the documents, allowing for flexibility and ad-hoc entities. Relation extraction is a complex task that typically requires sizable training corpora. The method proposed here is ideal for extracting specialized ontologies in a limited collection of documents.