LGJun 12, 2021

CARTL: Cooperative Adversarially-Robust Transfer Learning

arXiv:2106.06667v115 citations
Originality Incremental advance
AI Analysis

This addresses a critical issue for practitioners using transfer learning in security-sensitive applications, though it is an incremental improvement focused on robustness transfer.

The paper tackles the problem of adversarial robustness degradation during transfer learning, showing that standard fine-tuning reduces inherited robustness. The proposed CARTL method improves inherited robustness by up to 28% while maintaining accuracy.

Transfer learning eases the burden of training a well-performed model from scratch, especially when training data is scarce and computation power is limited. In deep learning, a typical strategy for transfer learning is to freeze the early layers of a pre-trained model and fine-tune the rest of its layers on the target domain. Previous work focuses on the accuracy of the transferred model but neglects the transfer of adversarial robustness. In this work, we first show that transfer learning improves the accuracy on the target domain but degrades the inherited robustness of the target model. To address such a problem, we propose a novel cooperative adversarially-robust transfer learning (CARTL) by pre-training the model via feature distance minimization and fine-tuning the pre-trained model with non-expansive fine-tuning for target domain tasks. Empirical results show that CARTL improves the inherited robustness by about 28% at most compared with the baseline with the same degree of accuracy. Furthermore, we study the relationship between the batch normalization (BN) layers and the robustness in the context of transfer learning, and we reveal that freezing BN layers can further boost the robustness transfer.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes