LG SEApr 4, 2023

CGDTest: A Constrained Gradient Descent Algorithm for Testing Neural Networks

Vineel Nagisetty, Laura Graves, Guanting Pan, Piyush Jha, Vijay Ganesh

arXiv:2304.01826v12.01 citationsh-index: 8

Originality Highly original

AI Analysis

This provides a more flexible and powerful tool for developers and researchers to test neural networks for a wide range of adversarial robustness issues, though it is incremental in extending gradient descent methods with constraint specifications.

The paper tackles the problem of testing neural networks for security and robustness issues like adversarial attacks and bias by proposing CGDTest, a constrained gradient descent algorithm that allows users to specify logical constraints. The results show that CGDTest outperforms 32 state-of-the-art methods, with improvements in PAR2 scores of over 1500% in some cases.

In this paper, we propose a new Deep Neural Network (DNN) testing algorithm called the Constrained Gradient Descent (CGD) method, and an implementation we call CGDTest aimed at exposing security and robustness issues such as adversarial robustness and bias in DNNs. Our CGD algorithm is a gradient-descent (GD) method, with the twist that the user can also specify logical properties that characterize the kinds of inputs that the user may want. This functionality sets CGDTest apart from other similar DNN testing tools since it allows users to specify logical constraints to test DNNs not only for $\ell_p$ ball-based adversarial robustness but, more importantly, includes richer properties such as disguised and flow adversarial constraints, as well as adversarial robustness in the NLP domain. We showcase the utility and power of CGDTest via extensive experimentation in the context of vision and NLP domains, comparing against 32 state-of-the-art methods over these diverse domains. Our results indicate that CGDTest outperforms state-of-the-art testing tools for $\ell_p$ ball-based adversarial robustness, and is significantly superior in testing for other adversarial robustness, with improvements in PAR2 scores of over 1500% in some cases over the next best tool. Our evaluation shows that our CGD method outperforms competing methods we compared against in terms of expressibility (i.e., a rich constraint language and concomitant tool support to express a wide variety of properties), scalability (i.e., can be applied to very large real-world models with up to 138 million parameters), and generality (i.e., can be used to test a plethora of model architectures).

View on arXiv PDF

Similar