CL AI NEMar 14, 2017

Making Neural QA as Simple as Possible but not Simpler

Dirk Weissenborn, Georg Wiese, Laura Seiffe

arXiv:1703.04816v316.5221 citations

Originality Incremental advance

AI Analysis

This work addresses the issue of over-engineering in neural QA systems for researchers and practitioners, highlighting that simpler baselines can be effective, making it an incremental contribution.

The authors tackled the problem of unnecessarily complex neural question answering systems by identifying two key requirements for high performance: question-aware context processing and non-bag-of-words composition functions. They showed that FastQA, a simple system meeting these requirements, achieves competitive performance, with results comparable to existing models.

Recent development of large-scale question answering (QA) datasets triggered a substantial amount of research into end-to-end neural architectures for QA. Increasingly complex systems have been conceived without comparison to simpler neural baseline systems that would justify their complexity. In this work, we propose a simple heuristic that guides the development of neural baseline systems for the extractive QA task. We find that there are two ingredients necessary for building a high-performing neural QA system: first, the awareness of question words while processing the context and second, a composition function that goes beyond simple bag-of-words modeling, such as recurrent neural networks. Our results show that FastQA, a system that meets these two requirements, can achieve very competitive performance compared with existing models. We argue that this surprising finding puts results of previous systems and the complexity of recent QA datasets into perspective.

View on arXiv PDF

Similar