CVAIApr 15, 2025

QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models

Tsinghua
arXiv:2504.11038v115 citationsh-index: 11Has CodeNAACL
Originality Incremental advance
AI Analysis

This work addresses vulnerabilities in LVLMs for practical security applications, representing an incremental advance by extending adversarial attacks to query-agnostic scenarios.

The paper tackles the problem of adversarial attacks on large vision-language models (LVLMs) by introducing QAVA, a query-agnostic visual attack that creates robust adversarial examples to generate incorrect responses to unspecified questions, achieving performance comparable to attacks on known target questions.

In typical multimodal tasks, such as Visual Question Answering (VQA), adversarial attacks targeting a specific image and question can lead large vision-language models (LVLMs) to provide incorrect answers. However, it is common for a single image to be associated with multiple questions, and LVLMs may still answer other questions correctly even for an adversarial image attacked by a specific question. To address this, we introduce the query-agnostic visual attack (QAVA), which aims to create robust adversarial examples that generate incorrect responses to unspecified and unknown questions. Compared to traditional adversarial attacks focused on specific images and questions, QAVA significantly enhances the effectiveness and efficiency of attacks on images when the question is unknown, achieving performance comparable to attacks on known target questions. Our research broadens the scope of visual adversarial attacks on LVLMs in practical settings, uncovering previously overlooked vulnerabilities, particularly in the context of visual adversarial threats. The code is available at https://github.com/btzyd/qava.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes