CLOct 19, 2020

Heads-up! Unsupervised Constituency Parsing via Self-Attention Heads

arXiv:2010.09517v1991 citations
Originality Incremental advance
AI Analysis

This provides an unsupervised parsing method for NLP researchers, enabling analysis of syntactic knowledge in PLMs and adaptation to low-resource languages, though it is incremental as it builds on existing attention-based analysis.

The paper tackles the problem of extracting constituency trees from pre-trained language models without supervision by ranking and ensembling attention heads, achieving performance that often surpasses existing approaches when no development set is available.

Transformer-based pre-trained language models (PLMs) have dramatically improved the state of the art in NLP across many tasks. This has led to substantial interest in analyzing the syntactic knowledge PLMs learn. Previous approaches to this question have been limited, mostly using test suites or probes. Here, we propose a novel fully unsupervised parsing approach that extracts constituency trees from PLM attention heads. We rank transformer attention heads based on their inherent properties, and create an ensemble of high-ranking heads to produce the final tree. Our method is adaptable to low-resource languages, as it does not rely on development sets, which can be expensive to annotate. Our experiments show that the proposed method often outperform existing approaches if there is no development set present. Our unsupervised parser can also be used as a tool to analyze the grammars PLMs learn implicitly. For this, we use the parse trees induced by our method to train a neural PCFG and compare it to a grammar derived from a human-annotated treebank.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes