CLAINEMar 14, 2017

Making Neural QA as Simple as Possible but not Simpler

arXiv:1703.04816v3221 citations
Originality Incremental advance
AI Analysis

This work addresses the issue of over-engineering in neural QA systems for researchers and practitioners, highlighting that simpler baselines can be effective, making it an incremental contribution.

The authors tackled the problem of unnecessarily complex neural question answering systems by identifying two key requirements for high performance: question-aware context processing and non-bag-of-words composition functions. They showed that FastQA, a simple system meeting these requirements, achieves competitive performance, with results comparable to existing models.

Recent development of large-scale question answering (QA) datasets triggered a substantial amount of research into end-to-end neural architectures for QA. Increasingly complex systems have been conceived without comparison to simpler neural baseline systems that would justify their complexity. In this work, we propose a simple heuristic that guides the development of neural baseline systems for the extractive QA task. We find that there are two ingredients necessary for building a high-performing neural QA system: first, the awareness of question words while processing the context and second, a composition function that goes beyond simple bag-of-words modeling, such as recurrent neural networks. Our results show that FastQA, a system that meets these two requirements, can achieve very competitive performance compared with existing models. We argue that this surprising finding puts results of previous systems and the complexity of recent QA datasets into perspective.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes