LGJan 21, 2025

BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks

Zhuang Li, Qiuping Yi, Zongcheng Ji, Yijian Lu, Yanqi Li, Keyang Xiao, Hongliang Liang

arXiv:2501.12174v62 citationsh-index: 1

Originality Incremental advance

AI Analysis

This addresses watermark detection for large language models, offering an incremental improvement over existing methods like KGW.

The paper tackles the problem of distinguishing AI-generated text from human content by proposing Bipolar Watermark (BiMarker), which splits generated text into positive and negative poles to enhance detection without extra computational resources. Experimental results demonstrate its effectiveness and compatibility with existing optimization techniques.

The rapid growth of Large Language Models (LLMs) raises concerns about distinguishing AI-generated text from human content. Existing watermarking techniques, like \kgw, struggle with low watermark strength and stringent false-positive requirements. Our analysis reveals that current methods rely on coarse estimates of non-watermarked text, limiting watermark detectability. To address this, we propose Bipolar Watermark (\tool), which splits generated text into positive and negative poles, enhancing detection without requiring additional computational resources or knowledge of the prompt. Theoretical analysis and experimental results demonstrate \tool's effectiveness and compatibility with existing optimization techniques, providing a new optimization dimension for watermarking in LLM-generated content.

View on arXiv PDF

Similar