CLMay 2, 2022

Robust Fine-tuning via Perturbation and Interpolation from In-batch Instances

Peking U
arXiv:2205.00633v17 citationsh-index: 38
Originality Incremental advance
AI Analysis

This addresses robustness issues in fine-tuning for NLP tasks, particularly in safe-critical scenarios, but is incremental as it builds on existing fine-tuning practices.

The paper tackles the vulnerability of pretrained language models to adversarial attacks and imbalanced data by proposing Match-Tuning, a fine-tuning method that uses in-batch instances for perturbation and interpolation, resulting in a 1.64 score improvement on GLUE benchmarks and enhanced robustness.

Fine-tuning pretrained language models (PLMs) on downstream tasks has become common practice in natural language processing. However, most of the PLMs are vulnerable, e.g., they are brittle under adversarial attacks or imbalanced data, which hinders the application of the PLMs on some downstream tasks, especially in safe-critical scenarios. In this paper, we propose a simple yet effective fine-tuning method called Match-Tuning to force the PLMs to be more robust. For each instance in a batch, we involve other instances in the same batch to interact with it. To be specific, regarding the instances with other labels as a perturbation, Match-Tuning makes the model more robust to noise at the beginning of training. While nearing the end, Match-Tuning focuses more on performing an interpolation among the instances with the same label for better generalization. Extensive experiments on various tasks in GLUE benchmark show that Match-Tuning consistently outperforms the vanilla fine-tuning by $1.64$ scores. Moreover, Match-Tuning exhibits remarkable robustness to adversarial attacks and data imbalance.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes