CRAIJan 5

AgentMark: Utility-Preserving Behavioral Watermarking for Agents

arXiv:2601.03294v11 citationsh-index: 4Has Code
Originality Incremental advance
AI Analysis

This addresses the need for IP protection and provenance in autonomous agents, offering a novel solution for black-box agent systems, though it is incremental in extending watermarking from content to behavior.

The paper tackles the problem of protecting intellectual property and ensuring regulatory provenance for LLM-based agents by proposing AgentMark, a behavioral watermarking framework that embeds multi-bit identifiers into planning decisions while preserving utility, achieving practical multi-bit capacity and robust recovery from partial logs in experiments.

LLM-based agents are increasingly deployed to autonomously solve complex tasks, raising urgent needs for IP protection and regulatory provenance. While content watermarking effectively attributes LLM-generated outputs, it fails to directly identify the high-level planning behaviors (e.g., tool and subgoal choices) that govern multi-step execution. Critically, watermarking at the planning-behavior layer faces unique challenges: minor distributional deviations in decision-making can compound during long-term agent operation, degrading utility, and many agents operate as black boxes that are difficult to intervene in directly. To bridge this gap, we propose AgentMark, a behavioral watermarking framework that embeds multi-bit identifiers into planning decisions while preserving utility. It operates by eliciting an explicit behavior distribution from the agent and applying distribution-preserving conditional sampling, enabling deployment under black-box APIs while remaining compatible with action-layer content watermarking. Experiments across embodied, tool-use, and social environments demonstrate practical multi-bit capacity, robust recovery from partial logs, and utility preservation. The code is available at https://github.com/Tooooa/AgentMark.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes