CVAIOct 6, 2025

ActiveMark: on watermarking of visual foundation models via massive activations

arXiv:2510.04966v1
Originality Incremental advance
AI Analysis

This addresses the need for reliable ownership verification tools to prevent illegal redistribution of VFMs, which is crucial for model owners but represents an incremental improvement in watermarking techniques.

The paper tackles the problem of intellectual property protection for visual foundation models (VFMs) by proposing a watermarking method that embeds digital watermarks into internal representations, ensuring detectability even after fine-tuning for downstream tasks, with demonstrated low false detection and misdetection probabilities.

Being trained on large and vast datasets, visual foundation models (VFMs) can be fine-tuned for diverse downstream tasks, achieving remarkable performance and efficiency in various computer vision applications. The high computation cost of data collection and training motivates the owners of some VFMs to distribute them alongside the license to protect their intellectual property rights. However, a dishonest user of the protected model's copy may illegally redistribute it, for example, to make a profit. As a consequence, the development of reliable ownership verification tools is of great importance today, since such methods can be used to differentiate between a redistributed copy of the protected model and an independent model. In this paper, we propose an approach to ownership verification of visual foundation models by fine-tuning a small set of expressive layers of a VFM along with a small encoder-decoder network to embed digital watermarks into an internal representation of a hold-out set of input images. Importantly, the watermarks embedded remain detectable in the functional copies of the protected model, obtained, for example, by fine-tuning the VFM for a particular downstream task. Theoretically and experimentally, we demonstrate that the proposed method yields a low probability of false detection of a non-watermarked model and a low probability of false misdetection of a watermarked model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes