RO AIMay 23

Silent Failures in Physical AI: A Literature Review of Runtime Action Authorization for Autonomous Systems

arXiv:2606.000907.5h-index: 1

Predicted impact top 62% in RO · last 90 daysOriginality Synthesis-oriented

AI Analysis

For researchers and developers of autonomous systems (robots, vehicles, drones), this review highlights the need for runtime authorization mechanisms to prevent silent failures, but it is a literature review with no experimental results.

This literature review identifies a critical safety gap in Physical AI systems: black-box models can issue physically consequential actions that appear correct but are actually unsafe due to silent failures (e.g., sensor drift, distribution shift). The authors synthesize existing work across multiple fields and propose a bounded problem formulation, a definition of silent physical-action failure, a taxonomy of runtime guardrail functions, and evaluation requirements for comparing guardrails.

Physical AI systems increasingly map multimodal observations, language instructions, and learned world representations into physically consequential actions. Robotics foundation models, vision-language-action models, and world-model-based autonomous systems can condition decisions that move vehicles, robots, drones, and industrial machines. This transition exposes a safety problem that is not fully captured by conventional AI content moderation or by classical robot safety alone: a black-box model may issue a physically consequential action while appearing confident, plausible, and semantically aligned. The resulting failure can be silent, arising from sensor drift, occlusion, state-estimation error, distribution shift, hallucinated affordances, or invalid physical assumptions before downstream hardware controllers detect a violation. Across embodied foundation models, world models, robotics simulation, embodied safety benchmarks, safe control, runtime assurance, uncertainty estimation, verification, and guardrail evaluation, model capability and safety mechanisms have advanced along largely separate technical tracks. A recurring gap synthesized here is that no single stream surveyed in this review supplies a complete runtime authorization boundary between black-box Physical AI models and physical execution. The resulting analysis develops a bounded problem formulation, a definition of silent physical-action failure, a taxonomy of runtime guardrail functions, and evaluation requirements for comparing guardrails as Physical AI assurance mechanisms.

View on arXiv PDF

Similar