CRAICLDec 28, 2023

Optimizing watermarks for large language models

arXiv:2312.17295v127 citationsh-index: 1ICML
Originality Incremental advance
AI Analysis

This work addresses the problem of misuse detection in LLMs for users needing reliable watermarking, but it appears incremental as it optimizes existing watermark classes.

The paper tackles the trade-off between identifiability and text quality in watermarks for large language models by formulating it as a multi-objective optimization problem, and shows that the identified Pareto optimal solutions outperform the default watermark.

With the rise of large language models (LLMs) and concerns about potential misuse, watermarks for generative LLMs have recently attracted much attention. An important aspect of such watermarks is the trade-off between their identifiability and their impact on the quality of the generated text. This paper introduces a systematic approach to this trade-off in terms of a multi-objective optimization problem. For a large class of robust, efficient watermarks, the associated Pareto optimal solutions are identified and shown to outperform the currently default watermark.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes