SEDec 19, 2020

A Declarative Metamorphic Testing Framework for Autonomous Driving

arXiv:2012.10672v457 citations
AI Analysis

This work provides a more flexible and comprehensive testing framework for autonomous driving models, improving the safety and reliability for developers and users of autonomous vehicles.

This paper addresses the limitations of existing metamorphic testing for autonomous driving models, which are restricted to a small set of metamorphic relations. The authors propose RMT, a declarative rule-based framework that allows users to specify an enriched set of testing scenarios using natural language, detecting a significant number of abnormal model predictions missed by prior work.

Autonomous driving has gained much attention from both industry and academia. Currently, Deep Neural Networks (DNNs) are widely used for perception and control in autonomous driving. However, several fatal accidents caused by autonomous vehicles have raised serious safety concerns about autonomous driving models. Some recent studies have successfully used the metamorphic testing technique to detect thousands of potential issues in some popularly used autonomous driving models. However, prior study is limited to a small set of metamorphic relations, which do not reflect rich, real-world traffic scenarios and are also not customizable. This paper presents a novel declarative rule-based metamorphic testing framework called RMT. RMT provides a rule template with natural language syntax, allowing users to flexibly specify an enriched set of testing scenarios based on real-world traffic rules and domain knowledge. RMT automatically parses human-written rules to metamorphic relations using an NLP-based rule parser referring to an ontology list and generates test cases with a variety of image transformation engines. We evaluated RMT on three autonomous driving models. With an enriched set of metamorphic relations, RMT detected a significant number of abnormal model predictions that were not detected by prior work. Through a large-scale human study on Amazon Mechanical Turk, we further confirmed the authenticity of test cases generated by RMT and the validity of detected abnormal model predictions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes