CVLGJun 26, 2022

Self-Healing Robust Neural Networks via Closed-Loop Control

arXiv:2206.12963v114 citationsh-index: 46Has Code
Originality Incremental advance
AI Analysis

This addresses robustness issues in neural networks for AI safety applications, offering a novel approach but with incremental improvements in defense mechanisms.

The paper tackles the vulnerability of neural networks to attacks by proposing a self-healing mechanism inspired by the immune system, which automatically detects and fixes errors without prior knowledge of perturbations, and validates it across architectures with broad applicability.

Despite the wide applications of neural networks, there have been increasing concerns about their vulnerability issue. While numerous attack and defense techniques have been developed, this work investigates the robustness issue from a new angle: can we design a self-healing neural network that can automatically detect and fix the vulnerability issue by itself? A typical self-healing mechanism is the immune system of a human body. This biology-inspired idea has been used in many engineering designs but is rarely investigated in deep learning. This paper considers the post-training self-healing of a neural network, and proposes a closed-loop control formulation to automatically detect and fix the errors caused by various attacks or perturbations. We provide a margin-based analysis to explain how this formulation can improve the robustness of a classifier. To speed up the inference of the proposed self-healing network, we solve the control problem via improving the Pontryagin Maximum Principle-based solver. Lastly, we present an error estimation of the proposed framework for neural networks with nonlinear activation functions. We validate the performance on several network architectures against various perturbations. Since the self-healing method does not need a-priori information about data perturbations/attacks, it can handle a broad class of unforeseen perturbations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes