CL AI HCJun 29, 2022

longhorns at DADC 2022: How many linguists does it take to fool a Question Answering model? A systematic approach to adversarial attacks

Venelin Kovatchev, Trina Chatterjee, Venkata S Govindarajan, Jifan Chen, Eunsol Choi, Gabriella Chronis, Anubrata Das, Katrin Erk, Matthew Lease, Junyi Jessy Li, Yating Wu, Kyle Mahowald

arXiv:2206.14729v131.8633 citationsh-index: 35

Originality Synthesis-oriented

AI Analysis

This work addresses improving NLP robustness via adversarial data collection, though it is incremental as it applies existing methods in a specific challenge.

The paper tackled the problem of fooling a question answering model through adversarial attacks, achieving a 62% error rate in a competition.

Developing methods to adversarially challenge NLP systems is a promising avenue for improving both model performance and interpretability. Here, we describe the approach of the team "longhorns" on Task 1 of the The First Workshop on Dynamic Adversarial Data Collection (DADC), which asked teams to manually fool a model on an Extractive Question Answering task. Our team finished first, with a model error rate of 62%. We advocate for a systematic, linguistically informed approach to formulating adversarial questions, and we describe the results of our pilot experiments, as well as our official submission.

View on arXiv PDF

Similar