MLAILGSep 19, 2024

Test-Time Augmentation Meets Variational Bayes

arXiv:2409.12587v15 citationsh-index: 2
Originality Incremental advance
AI Analysis

This work addresses a specific issue in robust machine learning for practitioners using TTA, but it is incremental as it builds on existing TTA techniques with a novel weighting method.

The paper tackles the problem of varying contributions from different data augmentation methods in Test-Time Augmentation (TTA), which can negatively impact prediction performance, by proposing a weighted TTA approach formalized in a variational Bayesian framework to optimize weights and suppress unwanted augmentations.

Data augmentation is known to contribute significantly to the robustness of machine learning models. In most instances, data augmentation is utilized during the training phase. Test-Time Augmentation (TTA) is a technique that instead leverages these data augmentations during the testing phase to achieve robust predictions. More precisely, TTA averages the predictions of multiple data augmentations of an instance to produce a final prediction. Although the effectiveness of TTA has been empirically reported, it can be expected that the predictive performance achieved will depend on the set of data augmentation methods used during testing. In particular, the data augmentation methods applied should make different contributions to performance. That is, it is anticipated that there may be differing degrees of contribution in the set of data augmentation methods used for TTA, and these could have a negative impact on prediction performance. In this study, we consider a weighted version of the TTA based on the contribution of each data augmentation. Some variants of TTA can be regarded as considering the problem of determining the appropriate weighting. We demonstrate that the determination of the coefficients of this weighted TTA can be formalized in a variational Bayesian framework. We also show that optimizing the weights to maximize the marginal log-likelihood suppresses candidates of unwanted data augmentations at the test phase.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes