CLMar 24, 2022

Probing for Labeled Dependency Trees

Max Müller-Eberstein, Rob van der Goot, Barbara Plank

arXiv:2203.12971v132.0640 citationsh-index: 46Has Code

Originality Incremental advance

AI Analysis

This work addresses a specific bottleneck in NLP representation analysis for dependency parsing, offering an incremental improvement in probe design for researchers in computational linguistics.

The paper tackles the limitation of linear probes in extracting only undirected or unlabeled dependency parse trees from embeddings by introducing DepProbe, which extracts labeled and directed trees with fewer parameters and compute, achieving 94% accuracy in identifying the best source treebank for transfer across 13 languages.

Probing has become an important tool for analyzing representations in Natural Language Processing (NLP). For graphical NLP tasks such as dependency parsing, linear probes are currently limited to extracting undirected or unlabeled parse trees which do not capture the full task. This work introduces DepProbe, a linear probe which can extract labeled and directed dependency parse trees from embeddings while using fewer parameters and compute than prior methods. Leveraging its full task coverage and lightweight parametrization, we investigate its predictive power for selecting the best transfer language for training a full biaffine attention parser. Across 13 languages, our proposed method identifies the best source treebank 94% of the time, outperforming competitive baselines and prior work. Finally, we analyze the informativeness of task-specific subspaces in contextual embeddings as well as which benefits a full parser's non-linear parametrization provides.

View on arXiv PDF Code

Similar