LG AI SEMar 25, 2024

DeepKnowledge: Generalisation-Driven Deep Learning Testing

Sondess Missaoui, Simos Gerasimou, Nikolaos Matragkas

arXiv:2403.16768v12.61 citationsh-index: 16

Originality Incremental advance

AI Analysis

This addresses the need for more dependable DNN testing in AI safety, though it is incremental as it builds on existing testing frameworks.

The paper tackles the problem of DNNs being fragile to data distribution shifts by introducing DeepKnowledge, a systematic testing methodology that assesses generalization capability, resulting in up to 10 percentage point improvements over state-of-the-art coverage criteria for detecting adversarial attacks on benchmarks like MNIST, SVHN, and CIFAR.

Despite their unprecedented success, DNNs are notoriously fragile to small shifts in data distribution, demanding effective testing techniques that can assess their dependability. Despite recent advances in DNN testing, there is a lack of systematic testing approaches that assess the DNN's capability to generalise and operate comparably beyond data in their training distribution. We address this gap with DeepKnowledge, a systematic testing methodology for DNN-based systems founded on the theory of knowledge generalisation, which aims to enhance DNN robustness and reduce the residual risk of 'black box' models. Conforming to this theory, DeepKnowledge posits that core computational DNN units, termed Transfer Knowledge neurons, can generalise under domain shift. DeepKnowledge provides an objective confidence measurement on testing activities of DNN given data distribution shifts and uses this information to instrument a generalisation-informed test adequacy criterion to check the transfer knowledge capacity of a test set. Our empirical evaluation of several DNNs, across multiple datasets and state-of-the-art adversarial generation techniques demonstrates the usefulness and effectiveness of DeepKnowledge and its ability to support the engineering of more dependable DNNs. We report improvements of up to 10 percentage points over state-of-the-art coverage criteria for detecting adversarial attacks on several benchmarks, including MNIST, SVHN, and CIFAR.

View on arXiv PDF

Similar