CVSep 24, 2025

FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models

arXiv:2509.19870v18 citationsh-index: 57
Originality Incremental advance
AI Analysis

This exposes a critical safety risk in robotics, potentially causing inaction during critical interventions, and is incremental as it formalizes and exploits an underexplored vulnerability.

The paper tackles the vulnerability of Vision-Language-Action (VLA) models to adversarial attacks that cause them to ignore instructions, achieving an average attack success rate of 76.2% across three models and four benchmarks.

Vision-Language-Action (VLA) models are driving rapid progress in robotics by enabling agents to interpret multimodal inputs and execute complex, long-horizon tasks. However, their safety and robustness against adversarial attacks remain largely underexplored. In this work, we identify and formalize a critical adversarial vulnerability in which adversarial images can "freeze" VLA models and cause them to ignore subsequent instructions. This threat effectively disconnects the robot's digital mind from its physical actions, potentially inducing inaction during critical interventions. To systematically study this vulnerability, we propose FreezeVLA, a novel attack framework that generates and evaluates action-freezing attacks via min-max bi-level optimization. Experiments on three state-of-the-art VLA models and four robotic benchmarks show that FreezeVLA attains an average attack success rate of 76.2%, significantly outperforming existing methods. Moreover, adversarial images generated by FreezeVLA exhibit strong transferability, with a single image reliably inducing paralysis across diverse language prompts. Our findings expose a critical safety risk in VLA models and highlight the urgent need for robust defense mechanisms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes