CVAug 19, 2025

Backdooring Self-Supervised Contrastive Learning by Noisy Alignment

arXiv:2508.14015v14 citationsh-index: 3Has Code
Originality Incremental advance
AI Analysis

This addresses a security problem for machine learning practitioners using contrastive learning, but it is incremental as it builds on existing backdoor attack methods.

The paper tackles the vulnerability of self-supervised contrastive learning to data poisoning backdoor attacks by proposing Noisy Alignment, a method that suppresses noise components in poisoned images, achieving state-of-the-art performance while maintaining clean-data accuracy.

Self-supervised contrastive learning (CL) effectively learns transferable representations from unlabeled data containing images or image-text pairs but suffers vulnerability to data poisoning backdoor attacks (DPCLs). An adversary can inject poisoned images into pretraining datasets, causing compromised CL encoders to exhibit targeted misbehavior in downstream tasks. Existing DPCLs, however, achieve limited efficacy due to their dependence on fragile implicit co-occurrence between backdoor and target object and inadequate suppression of discriminative features in backdoored images. We propose Noisy Alignment (NA), a DPCL method that explicitly suppresses noise components in poisoned images. Inspired by powerful training-controllable CL attacks, we identify and extract the critical objective of noisy alignment, adapting it effectively into data-poisoning scenarios. Our method implements noisy alignment by strategically manipulating contrastive learning's random cropping mechanism, formulating this process as an image layout optimization problem with theoretically derived optimal parameters. The resulting method is simple yet effective, achieving state-of-the-art performance compared to existing DPCLs, while maintaining clean-data accuracy. Furthermore, Noisy Alignment demonstrates robustness against common backdoor defenses. Codes can be found at https://github.com/jsrdcht/Noisy-Alignment.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes