CVJun 21, 2024

Effect of Rotation Angle in Self-Supervised Pre-training is Dataset-Dependent

arXiv:2407.05218v1
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of optimizing self-supervised pre-training parameters for researchers in computer vision, but it is incremental as it explores dataset-specific effects without conclusive results.

The study investigated how rotation angles in self-supervised contrastive pre-training affect feature learning across different datasets, finding that the impact is dataset-dependent, with visualizations showing varying saliency map alignments to ground truth segmentations.

Self-supervised learning for pre-training (SSP) can help the network learn better low-level features, especially when the size of the training set is small. In contrastive pre-training, the network is pre-trained to distinguish between different versions of the input. For example, the network learns to distinguish pairs (original, rotated) of images where the rotated image was rotated by angle $θ$ vs. other pairs of images. In this work, we show that, when training using contrastive pre-training in this way, the angle $θ$ and the dataset interact in interesting ways. We hypothesize, and give some evidence, that, for some datasets, the network can take "shortcuts" for particular rotation angles $θ$ based on the distribution of the gradient directions in the input, possibly avoiding learning features other than edges, but our experiments do not seem to support that hypothesis. We demonstrate experiments on three radiology datasets. We compute the saliency map indicating which pixels were important in the SSP process, and compare the saliency map to the ground truth foreground/background segmentation. Our visualizations indicate that the effects of rotation angles in SSP are dataset-dependent. We believe the distribution of gradient orientations may play a role in this, but our experiments so far are inconclusive.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes