ROAIMay 20, 2025

Safety2Drive: Safety-Critical Scenario Benchmark for the Evaluation of Autonomous Driving

arXiv:2505.13872v11 citationsh-index: 7Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of inadequate safety validation for autonomous driving systems, which poses risks to practical deployment, though it is incremental as it builds on existing benchmarks.

The authors tackled the lack of regulatory-compliant and safety-critical scenario libraries for evaluating autonomous driving systems by proposing Safety2Drive, a benchmark that includes 70 test items and supports scenario generalization with threats like natural corruptions and adversarial attacks.

Autonomous Driving (AD) systems demand the high levels of safety assurance. Despite significant advancements in AD demonstrated on open-source benchmarks like Longest6 and Bench2Drive, existing datasets still lack regulatory-compliant scenario libraries for closed-loop testing to comprehensively evaluate the functional safety of AD. Meanwhile, real-world AD accidents are underrepresented in current driving datasets. This scarcity leads to inadequate evaluation of AD performance, posing risks to safety validation and practical deployment. To address these challenges, we propose Safety2Drive, a safety-critical scenario library designed to evaluate AD systems. Safety2Drive offers three key contributions. (1) Safety2Drive comprehensively covers the test items required by standard regulations and contains 70 AD function test items. (2) Safety2Drive supports the safety-critical scenario generalization. It has the ability to inject safety threats such as natural environment corruptions and adversarial attacks cross camera and LiDAR sensors. (3) Safety2Drive supports multi-dimensional evaluation. In addition to the evaluation of AD systems, it also supports the evaluation of various perception tasks, such as object detection and lane detection. Safety2Drive provides a paradigm from scenario construction to validation, establishing a standardized test framework for the safe deployment of AD.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes