LG CRFeb 4, 2025

Adversarial ML Problems Are Getting Harder to Solve and to Evaluate

Javier Rando, Jie Zhang, Nicholas Carlini, Florian Tramèr

ETH Zurich

arXiv:2502.02260v127.326 citationsh-index: 35

Originality Synthesis-oriented

AI Analysis

This is an incremental position paper highlighting evaluation challenges in adversarial ML for researchers.

The paper argues that adversarial ML problems, especially for large language models, are becoming less clearly defined, harder to solve, and more challenging to evaluate, potentially leading to another decade of stalled progress.

In the past decade, considerable research effort has been devoted to securing machine learning (ML) models that operate in adversarial settings. Yet, progress has been slow even for simple "toy" problems (e.g., robustness to small adversarial perturbations) and is often hindered by non-rigorous evaluations. Today, adversarial ML research has shifted towards studying larger, general-purpose language models. In this position paper, we argue that the situation is now even worse: in the era of LLMs, the field of adversarial ML studies problems that are (1) less clearly defined, (2) harder to solve, and (3) even more challenging to evaluate. As a result, we caution that yet another decade of work on adversarial ML may fail to produce meaningful progress.

View on arXiv PDF

Similar