CR CL LGMay 12

TextSeal: A Localized LLM Watermark for Provenance & Distillation Protection

Tom Sander, Hongyan Chang, Tomáš Souček, Tuan Tran, Valeriu Lacatusu, Sylvestre-Alvise Rebuffi, Alexandre Mourachko, Surya Parimi, Christophe Ropers, Rashel Moritz, Vanessa Stark, Hady Elsahar

arXiv:2605.1245694.7

Predicted impact top 1% in CR · last 90 daysOriginality Incremental advance

AI Analysis

For LLM developers and deployers, TextSeal provides a practical, distortion-free watermarking method that is robust to dilution and supports serving optimizations, addressing the need for provenance and distillation protection.

TextSeal introduces a localized LLM watermark that enables provenance detection and unauthorized distillation detection without inference overhead, outperforming SynthID-text in detection strength and maintaining performance across reasoning benchmarks and multilingual human evaluation.

We introduce TextSeal, a state-of-the-art watermark for large language models. Building on Gumbel-max sampling, TextSeal introduces dual-key generation to restore output diversity, along with entropy-weighted scoring and multi-region localization for improved detection. It supports serving optimizations such as speculative decoding and multi-token prediction, and does not add any inference overhead. TextSeal strictly dominates baselines like SynthID-text in detection strength and is robust to dilution, maintaining confident localized detection even in heavily mixed human/AI documents. The scheme is theoretically distortion-free, and evaluation across reasoning benchmarks confirms that it preserves downstream performance; while a multilingual human evaluation (6000 A/B comparisons, 5 languages) shows no perceptible quality difference. Beyond its use for provenance detection, TextSeal is also ``radioactive'': its watermark signal transfers through model distillation, enabling detection of unauthorized use.

View on arXiv PDF

Similar