CLMay 29, 2019

The (Non-)Utility of Structural Features in BiLSTM-based Dependency Parsers

arXiv:1905.12676v21094 citations
Originality Incremental advance
AI Analysis

This work addresses the utility of structural features in neural parsing for NLP researchers, showing incremental insights into model behavior.

The study investigated whether BiLSTM-based dependency parsers implicitly capture structural context, finding that explicit structural features become redundant and that implicit structural information significantly influences parser performance.

Classical non-neural dependency parsers put considerable effort on the design of feature functions. Especially, they benefit from information coming from structural features, such as features drawn from neighboring tokens in the dependency tree. In contrast, their BiLSTM-based successors achieve state-of-the-art performance without explicit information about the structural context. In this paper we aim to answer the question: How much structural context are the BiLSTM representations able to capture implicitly? We show that features drawn from partial subtrees become redundant when the BiLSTMs are used. We provide a deep insight into information flow in transition- and graph-based neural architectures to demonstrate where the implicit information comes from when the parsers make their decisions. Finally, with model ablations we demonstrate that the structural context is not only present in the models, but it significantly influences their performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes