A Survey on the Safety and Security Threats of Computer-Using Agents: JARVIS or Ultron?

Peking UTencent
arXiv:2505.1092435.823 citationsh-index: 25
Predicted impact top 26% in CL · last 90 daysOriginality Synthesis-oriented
AI Analysis

For researchers and practitioners developing or deploying LLM-based autonomous agents, this survey systematizes emerging safety and security risks, offering a comprehensive reference.

This paper surveys safety and security threats in Computer-Using Agents (CUAs), proposing a taxonomy of threats and defenses, and summarizing benchmarks and metrics. It provides a structured foundation for future research and practical guidance for secure deployment.

Recently, AI-driven interactions with computing devices have advanced from basic prototype tools to sophisticated, LLM-based systems that emulate human-like operations in graphical user interfaces. We are now witnessing the emergence of \emph{Computer-Using Agents} (CUAs), capable of autonomously performing tasks such as navigating desktop applications, web pages, and mobile apps. However, as these agents grow in capability, they also introduce novel safety and security risks. Vulnerabilities in LLM-driven reasoning, with the added complexity of integrating multiple software components and multimodal inputs, further complicate the security landscape. In this paper, we present a systematization of knowledge on the safety and security threats of CUAs. We conduct a comprehensive literature review and distill our findings along four research objectives: \textit{\textbf{(i)}} define the CUA that suits safety analysis; \textit{\textbf{(ii)} } categorize current safety threats among CUAs; \textit{\textbf{(iii)}} propose a comprehensive taxonomy of existing defensive strategies; \textit{\textbf{(iv)}} summarize prevailing benchmarks, datasets, and evaluation metrics used to assess the safety and performance of CUAs. Building on these insights, our work provides future researchers with a structured foundation for exploring unexplored vulnerabilities and offers practitioners actionable guidance in designing and deploying secure Computer-Using Agents.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes