CVMar 11

SERUM: Simple, Efficient, Robust, and Unifying Marking for Diffusion-based Image Generation

arXiv:2603.1339648.21 citationsh-index: 8
AI Analysis

This provides a practical solution for reliably marking and detecting generated images from diffusion models, addressing a need in media authentication and content moderation.

The paper tackles the problem of marking images generated by diffusion models to distinguish them from natural images, proposing SERUM, which adds a unique watermark noise to the initial diffusion noise and trains a lightweight detector, achieving the highest true positive rate at a 1% false positive rate in most scenarios.

We propose SERUM: an intriguingly simple yet highly effective method for marking images generated by diffusion models (DMs). We only add a unique watermark noise to the initial diffusion generation noise and train a lightweight detector to identify watermarked images, simplifying and unifying the strengths of prior approaches. SERUM provides robustness against any image augmentations or watermark removal attacks and is extremely efficient, all while maintaining negligible impact on image quality. In contrast to prior approaches, which are often only resilient to limited perturbations and incur significant training, injection, and detection costs, our SERUM achieves remarkable performance, with the highest true positive rate (TPR) at a 1% false positive rate (FPR) in most scenarios, along with fast injection and detection and low detector training overhead. Its decoupled architecture also seamlessly supports multiple users by embedding individualized watermarks with little interference between the marks. Overall, our method provides a practical solution to mark outputs from DMs and to reliably distinguish generated from natural images.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes