CVDec 16, 2025

Erasing CLIP Memories: Non-Destructive, Data-Free Zero-Shot class Unlearning in CLIP Models

Ashish Mishra, Tarun Kumar, Gyanaranjan Nayak, Arpit Shah, Suparna Bhattacharya, Martin Foltin

arXiv:2512.14137v13.6h-index: 4

Originality Incremental advance

AI Analysis

This addresses model decontamination and privacy preservation for users of multimodal AI systems, though it is incremental as it builds on existing unlearning and nullspace projection techniques.

The paper tackles the problem of selectively erasing specific class information from pretrained CLIP models without retraining or using original data, achieving a pronounced drop in zero-shot performance for target classes while preserving overall model knowledge.

We introduce a novel, closed-form approach for selective unlearning in multimodal models, specifically targeting pretrained models such as CLIP. Our method leverages nullspace projection to erase the target class information embedded in the final projection layer, without requiring any retraining or the use of images from the forget set. By computing an orthonormal basis for the subspace spanned by target text embeddings and projecting these directions, we dramatically reduce the alignment between image features and undesired classes. Unlike traditional unlearning techniques that rely on iterative fine-tuning and extensive data curation, our approach is both computationally efficient and surgically precise. This leads to a pronounced drop in zero-shot performance for the target classes while preserving the overall multimodal knowledge of the model. Our experiments demonstrate that even a partial projection can balance between complete unlearning and retaining useful information, addressing key challenges in model decontamination and privacy preservation.

View on arXiv PDF

Similar