SEAICRLGSep 4, 2018

DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided Fuzzing

arXiv:1809.01266v342 citations
Originality Incremental advance
AI Analysis

It addresses safety-critical issues in DNN-based applications like autonomous driving by improving model quality evaluation, though it is incremental as it builds on existing fuzzing techniques.

The paper tackles the problem of hidden defects in deep neural networks (DNNs) by proposing DeepHunter, an automated fuzz testing framework that uses coverage-guided fuzzing to generate tests, resulting in significant coverage boosts and detection of erroneous behaviors across multiple datasets and DNNs.

In company with the data explosion over the past decade, deep neural network (DNN) based software has experienced unprecedented leap and is becoming the key driving force of many novel industrial applications, including many safety-critical scenarios such as autonomous driving. Despite great success achieved in various human intelligence tasks, similar to traditional software, DNNs could also exhibit incorrect behaviors caused by hidden defects causing severe accidents and losses. In this paper, we propose DeepHunter, an automated fuzz testing framework for hunting potential defects of general-purpose DNNs. DeepHunter performs metamorphic mutation to generate new semantically preserved tests, and leverages multiple plugable coverage criteria as feedback to guide the test generation from different perspectives. To be scalable towards practical-sized DNNs, DeepHunter maintains multiple tests in a batch, and prioritizes the tests selection based on active feedback. The effectiveness of DeepHunter is extensively investigated on 3 popular datasets (MNIST, CIFAR-10, ImageNet) and 7 DNNs with diverse complexities, under a large set of 6 coverage criteria as feedback. The large-scale experiments demonstrate that DeepHunter can (1) significantly boost the coverage with guidance; (2) generate useful tests to detect erroneous behaviors and facilitate the DNN model quality evaluation; (3) accurately capture potential defects during DNN quantization for platform migration.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes