AICVDec 10, 2023

Singular Value Penalization and Semantic Data Augmentation for Fully Test-Time Adaptation

arXiv:2312.08378v12 citations
Originality Incremental advance
AI Analysis

This work addresses domain adaptation for models in scenarios where source data is unavailable during testing, offering incremental improvements over existing FTTA methods.

The paper tackles the problem of fully test-time adaptation (FTTA) by proposing a method that maximizes the sum of singular values while minimizing their variance to enhance discriminability and diversity in predictions, and incorporates semantic data augmentation to reduce overfitting, achieving improved performance over state-of-the-art methods on benchmark datasets.

Fully test-time adaptation (FTTA) adapts a model that is trained on a source domain to a target domain during the testing phase, where the two domains follow different distributions and source data is unavailable during the training phase. Existing methods usually adopt entropy minimization to reduce the uncertainty of target prediction results, and improve the FTTA performance accordingly. However, they fail to ensure the diversity in target prediction results. Recent domain adaptation study has shown that maximizing the sum of singular values of prediction results can simultaneously enhance their confidence (discriminability) and diversity. However, during the training phase, larger singular values usually take up a dominant position in loss maximization. This results in the model being more inclined to enhance discriminability for easily distinguishable classes, and the improvement in diversity is insufficiently effective. Furthermore, the adaptation and prediction in FTTA only use data from the current batch, which may lead to the risk of overfitting. To address the aforementioned issues, we propose maximizing the sum of singular values while minimizing their variance. This enables the model's focus toward the smaller singular values, enhancing discriminability between more challenging classes and effectively increasing the diversity of prediction results. Moreover, we incorporate data from the previous batch to realize semantic data augmentation for the current batch, reducing the risk of overfitting. Extensive experiments on benchmark datasets show our proposed approach outperforms some compared state-of-the-art FTTA methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes