CLAIIRMay 26, 2019

Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives

arXiv:1905.10847v11120 citations
Originality Incremental advance
AI Analysis

This addresses the problem of handling long documents for reading comprehension tasks, offering a novel approach that improves performance for NLP applications, though it is incremental in combining existing techniques.

The paper tackles reading comprehension over long narratives by proposing a curriculum learning-based Pointer-Generator framework and an Introspective Alignment Layer, achieving state-of-the-art performance on the NarrativeQA benchmark with a 51% relative improvement on BLEU-4 and 17% on Rouge-L.

This paper tackles the problem of reading comprehension over long narratives where documents easily span over thousands of tokens. We propose a curriculum learning (CL) based Pointer-Generator framework for reading/sampling over large documents, enabling diverse training of the neural model based on the notion of alternating contextual difficulty. This can be interpreted as a form of domain randomization and/or generative pretraining during training. To this end, the usage of the Pointer-Generator softens the requirement of having the answer within the context, enabling us to construct diverse training samples for learning. Additionally, we propose a new Introspective Alignment Layer (IAL), which reasons over decomposed alignments using block-based self-attention. We evaluate our proposed method on the NarrativeQA reading comprehension benchmark, achieving state-of-the-art performance, improving existing baselines by $51\%$ relative improvement on BLEU-4 and $17\%$ relative improvement on Rouge-L. Extensive ablations confirm the effectiveness of our proposed IAL and CL components.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes