CLMar 28, 2022

ANNA: Enhanced Language Representation for Question Answering

arXiv:2203.14507v2641 citationsh-index: 10
Originality Incremental advance
AI Analysis

This work addresses the problem of enhancing question answering performance for NLP researchers and practitioners, but it is incremental as it builds on existing pre-training methods with minor extensions.

The paper tackled improving pre-trained language models for question answering by jointly considering data processing, pre-training tasks, neural network modeling, and fine-tuning, achieving state-of-the-art results of 95.7% F1 and 90.6% EM on SQuAD 1.1 and outperforming existing models on SQuAD 2.0.

Pre-trained language models have brought significant improvements in performance in a variety of natural language processing tasks. Most existing models performing state-of-the-art results have shown their approaches in the separate perspectives of data processing, pre-training tasks, neural network modeling, or fine-tuning. In this paper, we demonstrate how the approaches affect performance individually, and that the language model performs the best results on a specific question answering task when those approaches are jointly considered in pre-training models. In particular, we propose an extended pre-training task, and a new neighbor-aware mechanism that attends neighboring tokens more to capture the richness of context for pre-training language modeling. Our best model achieves new state-of-the-art results of 95.7\% F1 and 90.6\% EM on SQuAD 1.1 and also outperforms existing pre-trained language models such as RoBERTa, ALBERT, ELECTRA, and XLNet on the SQuAD 2.0 benchmark.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes