AIDec 4, 2024

Robust Multi-bit Text Watermark with LLM-based Paraphrasers

arXiv:2412.03123v29 citationsh-index: 9Has CodeICML
Originality Highly original
AI Analysis

This work addresses the need for robust and stealthy text watermarking for content authentication and copyright protection, representing a novel method rather than an incremental improvement.

The authors tackled the problem of embedding imperceptible multi-bit watermarks in text using LLM-based paraphrasers, achieving over 99.99% detection AUC while preserving semantic information and demonstrating robustness to perturbations and out-of-distribution data.

We propose an imperceptible multi-bit text watermark embedded by paraphrasing with LLMs. We fine-tune a pair of LLM paraphrasers that are designed to behave differently so that their paraphrasing difference reflected in the text semantics can be identified by a trained decoder. To embed our multi-bit watermark, we use two paraphrasers alternatively to encode the pre-defined binary code at the sentence level. Then we use a text classifier as the decoder to decode each bit of the watermark. Through extensive experiments, we show that our watermarks can achieve over 99.99\% detection AUC with small (1.1B) text paraphrasers while keeping the semantic information of the original sentence. More importantly, our pipeline is robust under word substitution and sentence paraphrasing perturbations and generalizes well to out-of-distributional data. We also show the stealthiness of our watermark with LLM-based evaluation. We open-source the code: https://github.com/xiaojunxu/multi-bit-text-watermark.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes