CLJun 12, 2017

Exploring the Syntactic Abilities of RNNs with Multi-task Learning

arXiv:1706.03542v11114 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving syntactic representation in RNNs for natural language processing, though it is incremental as it builds on prior methods.

The study tackled the problem of RNNs' syntactic limitations in complex sentences by using multi-task learning with additional tasks like CCG supertagging or language modeling, resulting in significantly lower error rates, particularly on complex sentences.

Recent work has explored the syntactic abilities of RNNs using the subject-verb agreement task, which diagnoses sensitivity to sentence structure. RNNs performed this task well in common cases, but faltered in complex sentences (Linzen et al., 2016). We test whether these errors are due to inherent limitations of the architecture or to the relatively indirect supervision provided by most agreement dependencies in a corpus. We trained a single RNN to perform both the agreement task and an additional task, either CCG supertagging or language modeling. Multi-task training led to significantly lower error rates, in particular on complex sentences, suggesting that RNNs have the ability to evolve more sophisticated syntactic representations than shown before. We also show that easily available agreement training data can improve performance on other syntactic tasks, in particular when only a limited amount of training data is available for those tasks. The multi-task paradigm can also be leveraged to inject grammatical knowledge into language models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes