SELGFeb 26, 2021

Distribution-Aware Testing of Neural Networks Using Generative Models

arXiv:2102.13602v166 citations
Originality Incremental advance
AI Analysis

This addresses reliability issues in critical DNN applications by improving testing validity, though it is incremental as it builds on existing testing frameworks.

The paper tackles the problem of invalid test inputs generated by existing DNN testing techniques, showing that three recent methods produce significant numbers of invalid inputs, which falsely inflate test coverage metrics. It proposes a deep generative model-based technique to generate only valid inputs, effectively eliminating invalid tests and boosting valid test generation.

The reliability of software that has a Deep Neural Network (DNN) as a component is urgently important today given the increasing number of critical applications being deployed with DNNs. The need for reliability raises a need for rigorous testing of the safety and trustworthiness of these systems. In the last few years, there have been a number of research efforts focused on testing DNNs. However the test generation techniques proposed so far lack a check to determine whether the test inputs they are generating are valid, and thus invalid inputs are produced. To illustrate this situation, we explored three recent DNN testing techniques. Using deep generative model based input validation, we show that all the three techniques generate significant number of invalid test inputs. We further analyzed the test coverage achieved by the test inputs generated by the DNN testing techniques and showed how invalid test inputs can falsely inflate test coverage metrics. To overcome the inclusion of invalid inputs in testing, we propose a technique to incorporate the valid input space of the DNN model under test in the test generation process. Our technique uses a deep generative model-based algorithm to generate only valid inputs. Results of our empirical studies show that our technique is effective in eliminating invalid tests and boosting the number of valid test inputs generated.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes