CL AI LGNov 12, 2024

Deceiving Question-Answering Models: A Hybrid Word-Level Adversarial Approach

Jiyao Li, Mingze Ni, Yongshun Gong, Wei Liu

arXiv:2411.08248v11.01 citationsh-index: 4Has Code

Originality Incremental advance

AI Analysis

This addresses the robustness of QA models for NLP applications, representing an incremental improvement in adversarial attack methods.

The paper tackles the problem of adversarial attacks on question-answering models by introducing QA-Attack, a word-level strategy that uses attention mechanisms and synonym substitution to deceive models, achieving high success rates and outperforming existing techniques in metrics like BLEU score and grammar error rate.

Deep learning underpins most of the currently advanced natural language processing (NLP) tasks such as textual classification, neural machine translation (NMT), abstractive summarization and question-answering (QA). However, the robustness of the models, particularly QA models, against adversarial attacks is a critical concern that remains insufficiently explored. This paper introduces QA-Attack (Question Answering Attack), a novel word-level adversarial strategy that fools QA models. Our attention-based attack exploits the customized attention mechanism and deletion ranking strategy to identify and target specific words within contextual passages. It creates deceptive inputs by carefully choosing and substituting synonyms, preserving grammatical integrity while misleading the model to produce incorrect responses. Our approach demonstrates versatility across various question types, particularly when dealing with extensive long textual inputs. Extensive experiments on multiple benchmark datasets demonstrate that QA-Attack successfully deceives baseline QA models and surpasses existing adversarial techniques regarding success rate, semantics changes, BLEU score, fluency and grammar error rate.

View on arXiv PDF Code

Similar