LG AIFeb 17, 2024

Maintaining Adversarial Robustness in Continuous Learning

Xiaolei Ru, Xiaowei Cao, Zijia Liu, Jack Murdoch Moore, Xin-Ya Zhang, Xia Zhu, Wenjia Wei, Gang Yan

arXiv:2402.11196v210.47 citationsh-index: 10

Originality Incremental advance

AI Analysis

This addresses the vulnerability of machine learning systems to adversarial attacks in continuous learning scenarios, representing an incremental improvement by combining gradient projection with existing defense algorithms.

The paper tackles the problem of adversarial robustness being lost during continual learning by proposing a gradient projection technique that stabilizes sample gradients from previous data, demonstrating superior performance in mitigating robustness degradation on benchmarks like Split-CIFAR100 and Split-miniImageNet.

Adversarial robustness is essential for security and reliability of machine learning systems. However, adversarial robustness enhanced by defense algorithms is easily erased as the neural network's weights update to learn new tasks. To address this vulnerability, it is essential to improve the capability of neural networks in terms of robust continual learning. Specially, we propose a novel gradient projection technique that effectively stabilizes sample gradients from previous data by orthogonally projecting back-propagation gradients onto a crucial subspace before using them for weight updates. This technique can maintaining robustness by collaborating with a class of defense algorithms through sample gradient smoothing. The experimental results on four benchmarks including Split-CIFAR100 and Split-miniImageNet, demonstrate that the superiority of the proposed approach in mitigating rapidly degradation of robustness during continual learning even when facing strong adversarial attacks.

View on arXiv PDF

Similar