WiRe57 : A Fine-Grained Benchmark for Open Information Extraction
This work provides a domain-specific benchmark for researchers in natural language processing, but it is incremental as it builds on existing Open IE tasks.
The authors tackled the problem of evaluating Open Information Extraction systems by creating WiRe57, a fine-grained benchmark on five documents, and found that MinIE performed best among seven compared extractors.
We build a reference for the task of Open Information Extraction, on five documents. We tentatively resolve a number of issues that arise, including inference and granularity. We seek to better pinpoint the requirements for the task. We produce our annotation guidelines specifying what is correct to extract and what is not. In turn, we use this reference to score existing Open IE systems. We address the non-trivial problem of evaluating the extractions produced by systems against the reference tuples, and share our evaluation script. Among seven compared extractors, we find the MinIE system to perform best.