CLJan 18, 2024

Power in Numbers: Robust reading comprehension by finetuning with four adversarial sentences per example

arXiv:2401.10091v1

Originality Incremental advance

AI Analysis

This work addresses the vulnerability of reading comprehension models to adversarial attacks, offering a method to enhance robustness, though it is incremental as it builds on prior adversarial research.

The paper tackles the problem of adversarial attacks on reading comprehension models, where appending adversarial sentences drastically reduces performance; by finetuning ELECTRA-Small on training examples with four or five adversarial sentences, the model achieves over 70% F1 score on most evaluation datasets, demonstrating improved robustness.

Recent models have achieved human level performance on the Stanford Question Answering Dataset when using F1 scores to evaluate the reading comprehension task. Yet, teaching machines to comprehend text has not been solved in the general case. By appending one adversarial sentence to the context paragraph, past research has shown that the F1 scores from reading comprehension models drop almost in half. In this paper, I replicate past adversarial research with a new model, ELECTRA-Small, and demonstrate that the new model's F1 score drops from 83.9% to 29.2%. To improve ELECTRA-Small's resistance to this attack, I finetune the model on SQuAD v1.1 training examples with one to five adversarial sentences appended to the context paragraph. Like past research, I find that the finetuned model on one adversarial sentence does not generalize well across evaluation datasets. However, when finetuned on four or five adversarial sentences the model attains an F1 score of more than 70% on most evaluation datasets with multiple appended and prepended adversarial sentences. The results suggest that with enough examples we can make models robust to adversarial attacks.

View on arXiv PDF

Similar