CRLGFeb 8, 2024

Buffer Overflow in Mixture of Experts

DeepMind
arXiv:2402.05526v113 citationsh-index: 26
Originality Synthesis-oriented
AI Analysis

This exposes a security flaw in widely used MoE systems for scaling large foundation models, potentially compromising inference integrity.

The paper identifies a vulnerability in Mixture of Experts (MoE) models where expert routing strategies with cross-batch dependencies can be exploited by malicious queries to affect outputs on benign queries in the same batch, demonstrated through a proof-of-concept attack in a toy setting.

Mixture of Experts (MoE) has become a key ingredient for scaling large foundation models while keeping inference costs steady. We show that expert routing strategies that have cross-batch dependencies are vulnerable to attacks. Malicious queries can be sent to a model and can affect a model's output on other benign queries if they are grouped in the same batch. We demonstrate this via a proof-of-concept attack in a toy experimental setting.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes