LGCRAug 23, 2023

Adversarial Training Using Feedback Loops

arXiv:2308.11881v21 citationsh-index: 46
Originality Incremental advance
AI Analysis

This addresses the need for robust DNNs against adversarial perturbations, which is crucial for reliable AI applications, but appears incremental as it builds on existing adversarial training techniques with a control theory twist.

The paper tackles the problem of deep neural networks being susceptible to adversarial attacks by proposing a new robustification approach based on control theory, called Feedback Looped Adversarial Training (FLAT), which is shown to be more effective than state-of-the-art methods in guarding against such attacks.

Deep neural networks (DNN) have found wide applicability in numerous fields due to their ability to accurately learn very complex input-output relations. Despite their accuracy and extensive use, DNNs are highly susceptible to adversarial attacks due to limited generalizability. For future progress in the field, it is essential to build DNNs that are robust to any kind of perturbations to the data points. In the past, many techniques have been proposed to robustify DNNs using first-order derivative information of the network. This paper proposes a new robustification approach based on control theory. A neural network architecture that incorporates feedback control, named Feedback Neural Networks, is proposed. The controller is itself a neural network, which is trained using regular and adversarial data such as to stabilize the system outputs. The novel adversarial training approach based on the feedback control architecture is called Feedback Looped Adversarial Training (FLAT). Numerical results on standard test problems empirically show that our FLAT method is more effective than the state-of-the-art to guard against adversarial attacks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes