Intelligent Multi-View Test Time Augmentation
This work addresses robustness in image classification for applications sensitive to viewpoint changes, but it is incremental as it builds on existing TTA methods with a novel selection mechanism.
The paper tackles the problem of improving image classification robustness against viewpoint variations by introducing an intelligent Test Time Augmentation (TTA) algorithm that selects optimal augmentations based on predictive uncertainty, resulting in an average accuracy improvement of 1.73% over single-view methods.
In this study, we introduce an intelligent Test Time Augmentation (TTA) algorithm designed to enhance the robustness and accuracy of image classification models against viewpoint variations. Unlike traditional TTA methods that indiscriminately apply augmentations, our approach intelligently selects optimal augmentations based on predictive uncertainty metrics. This selection is achieved via a two-stage process: the first stage identifies the optimal augmentation for each class by evaluating uncertainty levels, while the second stage implements an uncertainty threshold to determine when applying TTA would be advantageous. This methodological advancement ensures that augmentations contribute to classification more effectively than a uniform application across the dataset. Experimental validation across several datasets and neural network architectures validates our approach, yielding an average accuracy improvement of 1.73% over methods that use single-view images. This research underscores the potential of adaptive, uncertainty-aware TTA in improving the robustness of image classification in the presence of viewpoint variations, paving the way for further exploration into intelligent augmentation strategies.