LGCVMar 19, 2025

On the Robustness Tradeoff in Fine-Tuning

arXiv:2503.14836v26 citationsh-index: 9
Originality Incremental advance
AI Analysis

This work addresses the robustness-accuracy trade-off in fine-tuning for reliable real-world deployments, providing incremental insights into specific strategies.

The paper investigates the trade-off between adversarial robustness and accuracy in fine-tuning pre-trained models, finding that peripheral updates like BitFit improve robustness on simple tasks by over 75% above average, while fine-tuning attention layers with Compacter enhances it on complex tasks by up to 57.5% above average.

Fine-tuning has become the standard practice for adapting pre-trained models to downstream tasks. However, the impact on model robustness is not well understood. In this work, we characterize the robustness-accuracy trade-off in fine-tuning. We evaluate the robustness and accuracy of fine-tuned models over 6 benchmark datasets and 7 different fine-tuning strategies. We observe a consistent trade-off between adversarial robustness and accuracy. Peripheral updates such as BitFit are more effective for simple tasks -- over 75% above the average measured by the area under the Pareto frontiers on CIFAR-10 and CIFAR-100. In contrast, fine-tuning information-heavy layers, such as attention layers via Compacter, achieves a better Pareto frontier on more complex tasks -- 57.5% and 34.6% above the average on Caltech-256 and CUB-200, respectively. Lastly, we observe that the robustness of fine-tuning against out-of-distribution data closely tracks accuracy. These insights emphasize the need for robustness-aware fine-tuning to ensure reliable real-world deployments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes