CRIRLGSep 10, 2024

Adversarial Attacks to Multi-Modal Models

arXiv:2409.06793v224 citationsh-index: 11
Originality Incremental advance
AI Analysis

This addresses security vulnerabilities in multi-modal AI systems, which are increasingly used in applications like autonomous vehicles and content moderation, though it appears incremental as it builds on prior attack methods.

The paper tackles the problem of adversarial attacks on multi-modal models by introducing CrossFire, which transforms targeted inputs to match the modality of original media and optimizes perturbations to minimize embedding deviations. Experiments on six benchmark datasets show CrossFire significantly manipulates downstream tasks and surpasses existing attacks, while current defenses prove insufficient.

Multi-modal models have gained significant attention due to their powerful capabilities. These models effectively align embeddings across diverse data modalities, showcasing superior performance in downstream tasks compared to their unimodal counterparts. Recent study showed that the attacker can manipulate an image or audio file by altering it in such a way that its embedding matches that of an attacker-chosen targeted input, thereby deceiving downstream models. However, this method often underperforms due to inherent disparities in data from different modalities. In this paper, we introduce CrossFire, an innovative approach to attack multi-modal models. CrossFire begins by transforming the targeted input chosen by the attacker into a format that matches the modality of the original image or audio file. We then formulate our attack as an optimization problem, aiming to minimize the angular deviation between the embeddings of the transformed input and the modified image or audio file. Solving this problem determines the perturbations to be added to the original media. Our extensive experiments on six real-world benchmark datasets reveal that CrossFire can significantly manipulate downstream tasks, surpassing existing attacks. Additionally, we evaluate six defensive strategies against CrossFire, finding that current defenses are insufficient to counteract our CrossFire.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes