AICRDec 20, 2023

The Adaptive Arms Race: Redefining Robustness in AI Security

arXiv:2312.13435v33 citationsh-index: 29RAID
AI Analysis

This work addresses the vulnerability of black-box AI-based systems to adaptive adversaries, which is a critical security issue for deployed ML applications, though it builds incrementally on existing robustness evaluation methods.

The paper tackles the problem of evaluating and enhancing robustness in AI systems against decision-based attacks by broadening the concept of adaptivity to improve both attacks and defenses through mutual learning. It introduces a framework that outperforms state-of-the-art black-box attacks and defenses, providing insights into real-world ML system robustness.

Despite considerable efforts on making them robust, real-world AI-based systems remain vulnerable to decision based attacks, as definitive proofs of their operational robustness have so far proven intractable. Canonical robustness evaluation relies on adaptive attacks, which leverage complete knowledge of the defense and are tailored to bypass it. This work broadens the notion of adaptivity, which we employ to enhance both attacks and defenses, showing how they can benefit from mutual learning through interaction. We introduce a framework for adaptively optimizing black-box attacks and defenses under the competitive game they form. To assess robustness reliably, it is essential to evaluate against realistic and worst-case attacks. We thus enhance attacks and their evasive arsenal together using RL, apply the same principle to defenses, and evaluate them first independently and then jointly under a multi-agent perspective. We find that active defenses, those that dynamically control system responses, are an essential complement to model hardening against decision-based attacks; that these defenses can be circumvented by adaptive attacks, something that elicits defenses being adaptive too. Our findings, supported by an extensive theoretical and empirical investigation, confirm that adaptive adversaries pose a serious threat to black-box AI-based systems, rekindling the proverbial arms race. Notably, our approach outperforms the state-of-the-art black-box attacks and defenses, while bringing them together to render effective insights into the robustness of real-world deployed ML-based systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes