CLAILGMay 20, 2018

Learning compositionally through attentive guidance

arXiv:1805.09657v431 citations
Originality Incremental advance
AI Analysis

This addresses the issue of systematic compositionality in neural networks for researchers, though it is incremental as it builds on existing sequence-to-sequence models.

The paper tackled the problem of neural networks struggling with compositional generalization by introducing Attentive Guidance, a mechanism that helps sequence-to-sequence models find compositional solutions, showing that guided versions fit training and testing distributions almost equally well and generalize even when distributions diverge.

While neural network models have been successfully applied to domains that require substantial generalisation skills, recent studies have implied that they struggle when solving the task they are trained on requires inferring its underlying compositional structure. In this paper, we introduce Attentive Guidance, a mechanism to direct a sequence to sequence model equipped with attention to find more compositional solutions. We test it on two tasks, devised precisely to assess the compositional capabilities of neural models, and we show that vanilla sequence to sequence models with attention overfit the training distribution, while the guided versions come up with compositional solutions that fit the training and testing distributions almost equally well. Moreover, the learned solutions generalise even in cases where the training and testing distributions strongly diverge. In this way, we demonstrate that sequence to sequence models are capable of finding compositional solutions without requiring extra components. These results helps to disentangle the causes for the lack of systematic compositionality in neural networks, which can in turn fuel future work.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes