CRAIMay 3

Repurposing and Evaluating the (In)Feasibility of Dataset Poisoning enabled Watermarking for Contrastive Learning

arXiv:2605.0183456.2
Predicted impact top 34% in CR · last 90 daysOriginality Incremental advance
AI Analysis

For practitioners using contrastive learning with third-party data, this work provides a method to protect dataset intellectual property by repurposing weak backdoor attacks as watermarks.

The paper systematically evaluates data-poisoning backdoor attacks on contrastive learning, revealing limitations such as poor dataset adaptability and low success rates. It repurposes these weak backdoor effects as a watermark for dataset IP protection, achieving effective verification via a unified density metric and a multi-level watermarking scheme.

Contrastive learning (CL) reduces annotation cost via auto-derived supervisory signals. Since large-scale in-house CL datasets are infeasible, reliance on third-party or internet data is common. Recent studies show CL models are vulnerable to data-poisoning backdoor attacks, but their generalization and robustness are underexplored. We systematically evaluate existing data-poisoning backdoor attacks on CL, revealing limitations: poor dataset adaptability, low success rates, limited portability, and restrictive assumptions (e.g., downstream task knowledge). Interestingly, trigger samples exhibit distinguishable statistical divergence from clean samples, which inspires repurposing it as a watermark for dataset IP protection. Direct repurposing is challenging due to low success rates; we overcome this by statistical verification using a unified density metric. We further propose a multi-level watermarking scheme adapting to feature-level, soft-label, or hard-label outputs in CL. Experiments show some backdoor attacks can be repurposed as effective watermarks with trade-offs among fidelity, verifiability, and robustness. This work demonstrates weak backdoor effects become reliable signals for dataset IP protection in challenging CL settings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes