CVAICRLGMar 5, 2025

Task-Agnostic Attacks Against Vision Foundation Models

arXiv:2503.03842v11 citationsh-index: 7Has Code2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Originality Highly original
AI Analysis

This addresses a critical security gap for practitioners using pre-trained vision models across various applications, as attacks on foundation models remain largely unexplored.

The paper tackles the problem of security vulnerabilities in vision foundation models by proposing a task-agnostic adversarial attack framework that disrupts feature representations, and it extensively evaluates the attack's impact and transferability across multiple downstream tasks.

The study of security in machine learning mainly focuses on downstream task-specific attacks, where the adversarial example is obtained by optimizing a loss function specific to the downstream task. At the same time, it has become standard practice for machine learning practitioners to adopt publicly available pre-trained vision foundation models, effectively sharing a common backbone architecture across a multitude of applications such as classification, segmentation, depth estimation, retrieval, question-answering and more. The study of attacks on such foundation models and their impact to multiple downstream tasks remains vastly unexplored. This work proposes a general framework that forges task-agnostic adversarial examples by maximally disrupting the feature representation obtained with foundation models. We extensively evaluate the security of the feature representations obtained by popular vision foundation models by measuring the impact of this attack on multiple downstream tasks and its transferability between models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes