CLMar 24, 2022

Probing for Labeled Dependency Trees

arXiv:2203.12971v1640 citationsh-index: 46
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in NLP representation analysis for dependency parsing, offering an incremental improvement in probe design for researchers in computational linguistics.

The paper tackles the limitation of linear probes in extracting only undirected or unlabeled dependency parse trees from embeddings by introducing DepProbe, which extracts labeled and directed trees with fewer parameters and compute, achieving 94% accuracy in identifying the best source treebank for transfer across 13 languages.

Probing has become an important tool for analyzing representations in Natural Language Processing (NLP). For graphical NLP tasks such as dependency parsing, linear probes are currently limited to extracting undirected or unlabeled parse trees which do not capture the full task. This work introduces DepProbe, a linear probe which can extract labeled and directed dependency parse trees from embeddings while using fewer parameters and compute than prior methods. Leveraging its full task coverage and lightweight parametrization, we investigate its predictive power for selecting the best transfer language for training a full biaffine attention parser. Across 13 languages, our proposed method identifies the best source treebank 94% of the time, outperforming competitive baselines and prior work. Finally, we analyze the informativeness of task-specific subspaces in contextual embeddings as well as which benefits a full parser's non-linear parametrization provides.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes