CV AINov 14, 2025

AUVIC: Adversarial Unlearning of Visual Concepts for Multi-modal Large Language Models

Haokun Chen, Jianing Li, Yao Zhang, Jinhe Bi, Yan Xia, Jindong Gu, Volker Tresp

arXiv:2511.11299v16.22 citationsh-index: 21

Originality Incremental advance

AI Analysis

This addresses data privacy concerns for MLLM users by enabling precise visual concept unlearning, though it is incremental as it builds on existing unlearning techniques.

The paper tackles the problem of removing sensitive or copyrighted visual concepts from multimodal large language models to comply with data privacy regulations, introducing AUVIC, which achieves state-of-the-art forgetting rates with minimal performance degradation on non-target concepts.

Multimodal Large Language Models (MLLMs) achieve impressive performance once optimized on massive datasets. Such datasets often contain sensitive or copyrighted content, raising significant data privacy concerns. Regulatory frameworks mandating the 'right to be forgotten' drive the need for machine unlearning. This technique allows for the removal of target data without resource-consuming retraining. However, while well-studied for text, visual concept unlearning in MLLMs remains underexplored. A primary challenge is precisely removing a target visual concept without disrupting model performance on related entities. To address this, we introduce AUVIC, a novel visual concept unlearning framework for MLLMs. AUVIC applies adversarial perturbations to enable precise forgetting. This approach effectively isolates the target concept while avoiding unintended effects on similar entities. To evaluate our method, we construct VCUBench. It is the first benchmark designed to assess visual concept unlearning in group contexts. Experimental results demonstrate that AUVIC achieves state-of-the-art target forgetting rates while incurs minimal performance degradation on non-target concepts.

View on arXiv PDF

Similar