CRCLLGJan 28, 2025

Contextual Agent Security: A Policy for Every Purpose

arXiv:2501.17070v336 citationsh-index: 3HotOS
Originality Incremental advance
AI Analysis

This work addresses security challenges for generalist agents, which is an incremental step in adapting security to broader AI capabilities.

The paper tackles the problem of ensuring safety for generalist agents by proposing a framework called Conseca that generates just-in-time, contextual, and human-verifiable security policies, addressing the need for adaptable security designs as agents operate in diverse contexts.

Judging an action's safety requires knowledge of the context in which the action takes place. To human agents who act in various contexts, this may seem obvious: performing an action such as email deletion may or may not be appropriate depending on the email's content, the goal (e.g., to erase sensitive emails or to clean up trash), and the type of email address (e.g., work or personal). Unlike people, computational systems have often had only limited agency in limited contexts. Thus, manually crafted policies and user confirmation (e.g., smartphone app permissions or network access control lists), while imperfect, have sufficed to restrict harmful actions. However, with the upcoming deployment of generalist agents that support a multitude of tasks (e.g., an automated personal assistant), we argue that we must rethink security designs to adapt to the scale of contexts and capabilities of these systems. As a first step, this paper explores contextual security in the domain of agents and proposes contextual agent security (Conseca), a framework to generate just-in-time, contextual, and human-verifiable security policies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes