CLApr 17, 2018

ListOps: A Diagnostic Dataset for Latent Tree Learning

arXiv:1804.06028v133.21168 citations

Originality Synthesis-oriented

AI Analysis

This work addresses a diagnostic challenge for researchers in unsupervised parsing by providing a controlled dataset to test latent tree models, though it is incremental as it focuses on evaluation rather than model improvement.

The authors tackled the problem of evaluating latent tree learning models' parsing ability by introducing ListOps, a toy dataset with a single correct parsing strategy, and found that current leading models perform worse than sequential RNNs on this task.

Latent tree learning models learn to parse a sentence without syntactic supervision, and use that parse to build the sentence representation. Existing work on such models has shown that, while they perform well on tasks like sentence classification, they do not learn grammars that conform to any plausible semantic or syntactic formalism (Williams et al., 2018a). Studying the parsing ability of such models in natural language can be challenging due to the inherent complexities of natural language, like having several valid parses for a single sentence. In this paper we introduce ListOps, a toy dataset created to study the parsing ability of latent tree models. ListOps sequences are in the style of prefix arithmetic. The dataset is designed to have a single correct parsing strategy that a system needs to learn to succeed at the task. We show that the current leading latent tree models are unable to learn to parse and succeed at ListOps. These models achieve accuracies worse than purely sequential RNNs.

View on arXiv PDF

Similar