CLMay 20, 2012

Precision-biased Parsing and High-Quality Parse Selection

arXiv:1205.4387v13 citations
Originality Incremental advance
AI Analysis

This work improves parsing reliability for NLP applications by enabling selective high-accuracy outputs, though it is incremental in its approach.

The paper tackles the problem of dependency parsing by introducing precision-biased parsing, which allows abstention on uncertain decisions to prioritize accuracy, achieving over 96% accuracy on 84% of tokens. It also addresses high-quality parse selection, selecting over a third of trees with 97% accuracy.

We introduce precision-biased parsing: a parsing task which favors precision over recall by allowing the parser to abstain from decisions deemed uncertain. We focus on dependency-parsing and present an ensemble method which is capable of assigning parents to 84% of the text tokens while being over 96% accurate on these tokens. We use the precision-biased parsing task to solve the related high-quality parse-selection task: finding a subset of high-quality (accurate) trees in a large collection of parsed text. We present a method for choosing over a third of the input trees while keeping unlabeled dependency parsing accuracy of 97% on these trees. We also present a method which is not based on an ensemble but rather on directly predicting the risk associated with individual parser decisions. In addition to its efficiency, this method demonstrates that a parsing system can provide reasonable estimates of confidence in its predictions without relying on ensembles or aggregate corpus counts.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes