CLLGJun 24, 2019

Learning Latent Trees with Stochastic Perturbations and Differentiable Dynamic Programming

arXiv:1906.09992v11099 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of inducing syntactic structures for downstream NLP tasks in an unsupervised manner, though it appears incremental as it builds on prior latent tree learning methods.

The paper tackles the problem of learning latent projective dependency trees without direct supervision by using Gumbel perturbations and differentiable dynamic programming, achieving effectiveness in sentiment analysis and natural language inference tasks.

We treat projective dependency trees as latent variables in our probabilistic model and induce them in such a way as to be beneficial for a downstream task, without relying on any direct tree supervision. Our approach relies on Gumbel perturbations and differentiable dynamic programming. Unlike previous approaches to latent tree learning, we stochastically sample global structures and our parser is fully differentiable. We illustrate its effectiveness on sentiment analysis and natural language inference tasks. We also study its properties on a synthetic structure induction task. Ablation studies emphasize the importance of both stochasticity and constraining latent structures to be projective trees.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes