LGCVJun 30, 2025

Radioactive Watermarks in Diffusion and Autoregressive Image Generative Models

arXiv:2506.23731v15 citationsh-index: 12
Originality Incremental advance
AI Analysis

This addresses the problem of protecting intellectual property in generative AI for model developers, though it is incremental as it builds on existing watermarking concepts.

The paper tackles the problem of unauthorized use of generated images from diffusion and autoregressive models for training new models, finding that existing watermarking methods fail to retain radioactivity (persistence through training). It proposes the first radioactive watermarking method for image autoregressive models, showing effectiveness in preserving radioactivity and enabling robust provenance tracking.

Image generative models have become increasingly popular, but training them requires large datasets that are costly to collect and curate. To circumvent these costs, some parties may exploit existing models by using the generated images as training data for their own models. In general, watermarking is a valuable tool for detecting unauthorized use of generated images. However, when these images are used to train a new model, watermarking can only enable detection if the watermark persists through training and remains identifiable in the outputs of the newly trained model - a property known as radioactivity. We analyze the radioactivity of watermarks in images generated by diffusion models (DMs) and image autoregressive models (IARs). We find that existing watermarking methods for DMs fail to retain radioactivity, as watermarks are either erased during encoding into the latent space or lost in the noising-denoising process (during the training in the latent space). Meanwhile, despite IARs having recently surpassed DMs in image generation quality and efficiency, no radioactive watermarking methods have been proposed for them. To overcome this limitation, we propose the first watermarking method tailored for IARs and with radioactivity in mind - drawing inspiration from techniques in large language models (LLMs), which share IARs' autoregressive paradigm. Our extensive experimental evaluation highlights our method's effectiveness in preserving radioactivity within IARs, enabling robust provenance tracking, and preventing unauthorized use of their generated images.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes