CRAICLCYApr 24, 2024

Attacks on Third-Party APIs of Large Language Models

arXiv:2404.16891v10.1517 citationsh-index: 22Has Code
AI Analysis80

This addresses a critical safety problem for users and developers of LLM ecosystems, highlighting vulnerabilities introduced by untrusted third-party integrations.

The paper tackles the security risks of third-party API plugins in large language model (LLM) services by proposing an attacking framework that identifies real-world malicious attacks capable of imperceptibly modifying LLM outputs across various domains.

Large language model (LLM) services have recently begun offering a plugin ecosystem to interact with third-party API services. This innovation enhances the capabilities of LLMs, but it also introduces risks, as these plugins developed by various third parties cannot be easily trusted. This paper proposes a new attacking framework to examine security and safety vulnerabilities within LLM platforms that incorporate third-party services. Applying our framework specifically to widely used LLMs, we identify real-world malicious attacks across various domains on third-party APIs that can imperceptibly modify LLM outputs. The paper discusses the unique challenges posed by third-party API integration and offers strategic possibilities to improve the security and safety of LLM ecosystems moving forward. Our code is released at https://github.com/vk0812/Third-Party-Attacks-on-LLMs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes