CR AI CL CYApr 24, 2024

Attacks on Third-Party APIs of Large Language Models

Wanru Zhao, Vidit Khazanchi, Haodi Xing, Xuanli He, Qiongkai Xu, Nicholas Donald Lane

arXiv:2404.16891v115.417 citationsh-index: 22Has Code

Originality Highly original

AI Analysis

This addresses a critical safety problem for users and developers of LLM ecosystems, highlighting vulnerabilities introduced by untrusted third-party integrations.

The paper tackles the security risks of third-party API plugins in large language model (LLM) services by proposing an attacking framework that identifies real-world malicious attacks capable of imperceptibly modifying LLM outputs across various domains.

Large language model (LLM) services have recently begun offering a plugin ecosystem to interact with third-party API services. This innovation enhances the capabilities of LLMs, but it also introduces risks, as these plugins developed by various third parties cannot be easily trusted. This paper proposes a new attacking framework to examine security and safety vulnerabilities within LLM platforms that incorporate third-party services. Applying our framework specifically to widely used LLMs, we identify real-world malicious attacks across various domains on third-party APIs that can imperceptibly modify LLM outputs. The paper discusses the unique challenges posed by third-party API integration and offers strategic possibilities to improve the security and safety of LLM ecosystems moving forward. Our code is released at https://github.com/vk0812/Third-Party-Attacks-on-LLMs.

View on arXiv PDF Code

Similar