CVNov 14, 2022

Information-guided pixel augmentation for pixel-wise contrastive learning

arXiv:2211.07118v15 citationsh-index: 14
Originality Incremental advance
AI Analysis

This work addresses a gap in pixel-wise contrastive learning for tasks like medical landmark detection, though it is incremental as it adapts instance-level augmentation principles to the pixel level.

The paper tackles the lack of pixel augmentation strategies for pixel-wise contrastive learning by proposing an information-guided method that classifies pixels into low-, medium-, and high-informative categories and designs separate augmentation strategies for each, resulting in improved unsupervised local feature matching and enhanced performance for one-shot and fully supervised models.

Contrastive learning (CL) is a form of self-supervised learning and has been widely used for various tasks. Different from widely studied instance-level contrastive learning, pixel-wise contrastive learning mainly helps with pixel-wise tasks such as medical landmark detection. The counterpart to an instance in instance-level CL is a pixel, along with its neighboring context, in pixel-wise CL. Aiming to build better feature representation, there is a vast literature about designing instance augmentation strategies for instance-level CL; but there is little similar work on pixel augmentation for pixel-wise CL with a pixel granularity. In this paper, we attempt to bridge this gap. We first classify a pixel into three categories, namely low-, medium-, and high-informative, based on the information quantity the pixel contains. Inspired by the ``InfoMin" principle, we then design separate augmentation strategies for each category in terms of augmentation intensity and sampling ratio. Extensive experiments validate that our information-guided pixel augmentation strategy succeeds in encoding more discriminative representations and surpassing other competitive approaches in unsupervised local feature matching. Furthermore, our pretrained model improves the performance of both one-shot and fully supervised models. To the best of our knowledge, we are the first to propose a pixel augmentation method with a pixel granularity for enhancing unsupervised pixel-wise contrastive learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes