CLLGAug 28, 2018

Unsupervised Learning of Syntactic Structure with Invertible Neural Projections

arXiv:1808.09111v11111 citations
Originality Highly original
AI Analysis

This work addresses the problem of unsupervised syntactic analysis for natural language processing, offering incremental improvements in specific tasks.

The paper tackles unsupervised learning of syntactic structure by proposing a generative model that jointly learns discrete syntax and continuous word representations using invertible neural networks, achieving state-of-the-art results on POS induction and unsupervised dependency parsing on the Penn Treebank without gold annotations.

Unsupervised learning of syntactic structure is typically performed using generative models with discrete latent variables and multinomial parameters. In most cases, these models have not leveraged continuous word representations. In this work, we propose a novel generative model that jointly learns discrete syntactic structure and continuous word representations in an unsupervised fashion by cascading an invertible neural network with a structured generative prior. We show that the invertibility condition allows for efficient exact inference and marginal likelihood computation in our model so long as the prior is well-behaved. In experiments we instantiate our approach with both Markov and tree-structured priors, evaluating on two tasks: part-of-speech (POS) induction, and unsupervised dependency parsing without gold POS annotation. On the Penn Treebank, our Markov-structured model surpasses state-of-the-art results on POS induction. Similarly, we find that our tree-structured model achieves state-of-the-art performance on unsupervised dependency parsing for the difficult training condition where neither gold POS annotation nor punctuation-based constraints are available.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes