CRJan 28, 2019

Defense Methods Against Adversarial Examples for Recurrent Neural Networks

arXiv:1901.09963v544 citations
Originality Incremental advance
AI Analysis

This addresses the vulnerability of RNNs to adversarial attacks, particularly in cybersecurity and NLP domains, representing an incremental advancement by adapting CNN defense methods to sequence-based models.

The paper tackles the problem of adversarial examples for recurrent neural networks (RNNs) by proposing a novel defense method called sequence squeezing, along with four additional methods, which reduced attack effectiveness from 99.9% to 15% in cybersecurity evaluations.

Adversarial examples are known to mislead deep learning models to incorrectly classify them, even in domains where such models achieve state-of-the-art performance. Until recently, research on both attack and defense methods focused on image recognition, primarily using convolutional neural networks (CNNs). In recent years, adversarial example generation methods for recurrent neural networks (RNNs) have been published, demonstrating that RNN classifiers are also vulnerable to such attacks. In this paper, we present a novel defense method, termed sequence squeezing, to make RNN classifiers more robust against such attacks. Our method differs from previous defense methods which were designed only for non-sequence based models. We also implement four additional RNN defense methods inspired by recently published CNN defense methods. We evaluate our methods against state-of-the-art attacks in the cyber security domain where real adversaries (malware developers) exist, but our methods can be applied against other discrete sequence based adversarial attacks, e.g., in the NLP domain. Using our methods we were able to decrease the effectiveness of such attack from 99.9% to 15%.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes