LGAIFeb 17, 2024

Maintaining Adversarial Robustness in Continuous Learning

arXiv:2402.11196v27 citationsh-index: 10
AI Analysis

This addresses the vulnerability of machine learning systems to adversarial attacks in continuous learning scenarios, representing an incremental improvement by combining gradient projection with existing defense algorithms.

The paper tackles the problem of adversarial robustness being lost during continual learning by proposing a gradient projection technique that stabilizes sample gradients from previous data, demonstrating superior performance in mitigating robustness degradation on benchmarks like Split-CIFAR100 and Split-miniImageNet.

Adversarial robustness is essential for security and reliability of machine learning systems. However, adversarial robustness enhanced by defense algorithms is easily erased as the neural network's weights update to learn new tasks. To address this vulnerability, it is essential to improve the capability of neural networks in terms of robust continual learning. Specially, we propose a novel gradient projection technique that effectively stabilizes sample gradients from previous data by orthogonally projecting back-propagation gradients onto a crucial subspace before using them for weight updates. This technique can maintaining robustness by collaborating with a class of defense algorithms through sample gradient smoothing. The experimental results on four benchmarks including Split-CIFAR100 and Split-miniImageNet, demonstrate that the superiority of the proposed approach in mitigating rapidly degradation of robustness during continual learning even when facing strong adversarial attacks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes