TIACam: Text-Anchored Invariant Feature Learning with Auto-Augmentation for Camera-Robust Zero-Watermarking

Abdullah All Tanvir, Agnibh Dasgupta, Xin Zhong

arXiv:2602.18863v12.6h-index: 13

Originality Highly original

AI Analysis

This addresses the problem of robust zero-watermarking against camera distortions for applications like content authentication, with incremental improvements in feature learning and auto-augmentation.

The paper tackled the problem of camera recapture introducing optical degradations that challenge deep watermarking systems, and presented TIACam, a framework that achieved state-of-the-art feature stability and watermark extraction accuracy in experiments on synthetic and real-world captures.

Camera recapture introduces complex optical degradations, such as perspective warping, illumination shifts, and Moiré interference, that remain challenging for deep watermarking systems. We present TIACam, a text-anchored invariant feature learning framework with auto-augmentation for camera-robust zero-watermarking. The method integrates three key innovations: (1) a learnable auto-augmentor that discovers camera-like distortions through differentiable geometric, photometric, and Moiré operators; (2) a text-anchored invariant feature learner that enforces semantic consistency via cross-modal adversarial alignment between image and text; and (3) a zero-watermarking head that binds binary messages in the invariant feature space without modifying image pixels. This unified formulation jointly optimizes invariance, semantic alignment, and watermark recoverability. Extensive experiments on both synthetic and real-world camera captures demonstrate that TIACam achieves state-of-the-art feature stability and watermark extraction accuracy, establishing a principled bridge between multimodal invariance learning and physically robust zero-watermarking.

View on arXiv PDF

Similar