CLNov 1, 2020

Bracketing Encodings for 2-Planar Dependency Parsing

arXiv:2011.00596v20.00995 citations
AI Analysis55

This work addresses a limitation in dependency parsing for computational linguistics, enabling more accurate handling of non-projective syntactic structures, though it is incremental as it builds on existing bracketing and 2-planarity concepts.

The authors tackled the problem of representing non-projective dependency trees in sequence labeling parsing by introducing a bracketing-based encoding for 2-planar trees, achieving almost total coverage (99.9%) of crossing arcs and improving accuracy by 0.4 LAS on average in highly non-projective treebanks.

We present a bracketing-based encoding that can be used to represent any 2-planar dependency tree over a sentence of length n as a sequence of n labels, hence providing almost total coverage of crossing arcs in sequence labeling parsing. First, we show that existing bracketing encodings for parsing as labeling can only handle a very mild extension of projective trees. Second, we overcome this limitation by taking into account the well-known property of 2-planarity, which is present in the vast majority of dependency syntactic structures in treebanks, i.e., the arcs of a dependency tree can be split into two planes such that arcs in a given plane do not cross. We take advantage of this property to design a method that balances the brackets and that encodes the arcs belonging to each of those planes, allowing for almost unrestricted non-projectivity (round 99.9% coverage) in sequence labeling parsing. The experiments show that our linearizations improve over the accuracy of the original bracketing encoding in highly non-projective treebanks (on average by 0.4 LAS), while achieving a similar speed. Also, they are especially suitable when PoS tags are not used as input parameters to the models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes