LG CR CV MLMay 17, 2019

POPQORN: Quantifying Robustness of Recurrent Neural Networks

Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Daniel, Ngai Wong, Dahua Lin

arXiv:1905.07387v121.387 citationsHas Code

Originality Highly original

AI Analysis

This work addresses the critical issue of adversarial vulnerability in RNNs, providing a novel method for robustness quantification in a domain where it was previously an open problem.

The authors tackled the problem of quantifying robustness for recurrent neural networks (RNNs), including LSTMs and GRUs, by proposing POPQORN, an algorithm that computes certified lower bounds for adversarial perturbations, demonstrating its effectiveness across different architectures.

The vulnerability to adversarial attacks has been a critical issue for deep neural networks. Addressing this issue requires a reliable way to evaluate the robustness of a network. Recently, several methods have been developed to compute $\textit{robustness quantification}$ for neural networks, namely, certified lower bounds of the minimum adversarial perturbation. Such methods, however, were devised for feed-forward networks, e.g. multi-layer perceptron or convolutional networks. It remains an open problem to quantify robustness for recurrent networks, especially LSTM and GRU. For such networks, there exist additional challenges in computing the robustness quantification, such as handling the inputs at multiple steps and the interaction between gates and states. In this work, we propose $\textit{POPQORN}$ ($\textbf{P}$ropagated-$\textbf{o}$ut$\textbf{p}$ut $\textbf{Q}$uantified R$\textbf{o}$bustness for $\textbf{RN}$Ns), a general algorithm to quantify robustness of RNNs, including vanilla RNNs, LSTMs, and GRUs. We demonstrate its effectiveness on different network architectures and show that the robustness quantification on individual steps can lead to new insights.

View on arXiv PDF Code

Similar