MMCVIVMar 14, 2025

Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking

arXiv:2503.11324v13 citationsh-index: 3
Originality Highly original
AI Analysis

This work addresses misuse prevention in text-to-image generation for users of autoregressive models, representing a novel method for a known bottleneck.

The paper tackles the problem of invisible watermarking for autoregressive text-to-image models, which is underexplored, and proposes Safe-VAR, a framework that achieves state-of-the-art performance in image quality, watermark fidelity, and robustness, with strong generalization to out-of-domain datasets like QR Codes.

With the success of autoregressive learning in large language models, it has become a dominant approach for text-to-image generation, offering high efficiency and visual quality. However, invisible watermarking for visual autoregressive (VAR) models remains underexplored, despite its importance in misuse prevention. Existing watermarking methods, designed for diffusion models, often struggle to adapt to the sequential nature of VAR models. To bridge this gap, we propose Safe-VAR, the first watermarking framework specifically designed for autoregressive text-to-image generation. Our study reveals that the timing of watermark injection significantly impacts generation quality, and watermarks of different complexities exhibit varying optimal injection times. Motivated by this observation, we propose an Adaptive Scale Interaction Module, which dynamically determines the optimal watermark embedding strategy based on the watermark information and the visual characteristics of the generated image. This ensures watermark robustness while minimizing its impact on image quality. Furthermore, we introduce a Cross-Scale Fusion mechanism, which integrates mixture of both heads and experts to effectively fuse multi-resolution features and handle complex interactions between image content and watermark patterns. Experimental results demonstrate that Safe-VAR achieves state-of-the-art performance, significantly surpassing existing counterparts regarding image quality, watermarking fidelity, and robustness against perturbations. Moreover, our method exhibits strong generalization to an out-of-domain watermark dataset QR Codes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes