CVAug 8, 2025

Towards Robust Red-Green Watermarking for Autoregressive Image Generators

arXiv:2508.06656v12 citationsh-index: 13
Originality Incremental advance
AI Analysis

This work addresses the need for robust watermarking in autoregressive image models, which is an incremental advancement over existing methods for latent diffusion models.

The paper tackled the problem of watermarking autoregressive image generators to detect and attribute generated content, proposing two novel methods that improve robustness against perturbations and regeneration attacks while preserving image quality, with experiments showing enhanced detectability and fast verification runtime.

In-generation watermarking for detecting and attributing generated content has recently been explored for latent diffusion models (LDMs), demonstrating high robustness. However, the use of in-generation watermarks in autoregressive (AR) image models has not been explored yet. AR models generate images by autoregressively predicting a sequence of visual tokens that are then decoded into pixels using a vector-quantized decoder. Inspired by red-green watermarks for large language models, we examine token-level watermarking schemes that bias the next-token prediction based on prior tokens. We find that a direct transfer of these schemes works in principle, but the detectability of the watermarks decreases considerably under common image perturbations. As a remedy, we propose two novel watermarking methods that rely on visual token clustering to assign similar tokens to the same set. Firstly, we investigate a training-free approach that relies on a cluster lookup table, and secondly, we finetune VAE encoders to predict token clusters directly from perturbed images. Overall, our experiments show that cluster-level watermarks improve robustness against perturbations and regeneration attacks while preserving image quality. Cluster classification further boosts watermark detectability, outperforming a set of baselines. Moreover, our methods offer fast verification runtime, comparable to lightweight post-hoc watermarking methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes