CLApr 25, 2023

Test-Time Adaptation with Perturbation Consistency Learning

arXiv:2304.12764v11 citationsh-index: 44
Originality Incremental advance
AI Analysis

This addresses the issue of models failing in real test scenarios due to distribution shifts, offering a more efficient test-time adaptation method for language models.

The paper tackles the problem of distribution shifts in pre-trained language models by proposing perturbation consistency learning (PCL) for test-time adaptation, achieving higher or comparable performance with less inference time in experiments on adversarial robustness and cross-lingual transferring.

Currently, pre-trained language models (PLMs) do not cope well with the distribution shift problem, resulting in models trained on the training set failing in real test scenarios. To address this problem, the test-time adaptation (TTA) shows great potential, which updates model parameters to suit the test data at the testing time. Existing TTA methods rely on well-designed auxiliary tasks or self-training strategies based on pseudo-label. However, these methods do not achieve good trade-offs regarding performance gains and computational costs. To obtain some insights into such a dilemma, we take two representative TTA methods, i.e., Tent and OIL, for exploration and find that stable prediction is the key to achieving a good balance. Accordingly, in this paper, we propose perturbation consistency learning (PCL), a simple test-time adaptation method to promote the model to make stable predictions for samples with distribution shifts. Extensive experiments on adversarial robustness and cross-lingual transferring demonstrate that our method can achieve higher or comparable performance with less inference time over strong PLM backbones and previous state-of-the-art TTA methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes