CLAug 29, 2018

Grammar Induction with Neural Language Models: An Unusual Replication

arXiv:1808.10000v11113 citations
AI Analysis

This work addresses the problem of grammar induction for computational linguistics by validating and extending a neural approach, though it is incremental as it builds on existing methods.

The authors attempted to replicate a prior neural model for latent tree learning and found issues in the original study, but in fair experiments, their model outperformed baselines and was competitive with symbolic systems, showing robust results.

A substantial thread of recent work on latent tree learning has attempted to develop neural network models with parse-valued latent variables and train them on non-parsing tasks, in the hope of having them discover interpretable tree structure. In a recent paper, Shen et al. (2018) introduce such a model and report near-state-of-the-art results on the target task of language modeling, and the first strong latent tree learning result on constituency parsing. In an attempt to reproduce these results, we discover issues that make the original results hard to trust, including tuning and even training on what is effectively the test set. Here, we attempt to reproduce these results in a fair experiment and to extend them to two new datasets. We find that the results of this work are robust: All variants of the model under study outperform all latent tree learning baselines, and perform competitively with symbolic grammar induction systems. We find that this model represents the first empirical success for latent tree learning, and that neural network language modeling warrants further study as a setting for grammar induction.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes