CVLGDec 1, 2022

Hyperbolic Contrastive Learning for Visual Representations beyond Objects

arXiv:2212.00653v184 citationsh-index: 35Has Code
Originality Incremental advance
AI Analysis

This work addresses the need for better scene-object representations in computer vision, offering incremental improvements over existing methods.

The paper tackles the problem of learning visual representations that preserve hierarchical structure between scenes and objects by proposing a contrastive learning framework with a hyperbolic loss for scenes, improving downstream performance on tasks like image classification and object detection across multiple datasets.

Although self-/un-supervised methods have led to rapid progress in visual representation learning, these methods generally treat objects and scenes using the same lens. In this paper, we focus on learning representations for objects and scenes that preserve the structure among them. Motivated by the observation that visually similar objects are close in the representation space, we argue that the scenes and objects should instead follow a hierarchical structure based on their compositionality. To exploit such a structure, we propose a contrastive learning framework where a Euclidean loss is used to learn object representations and a hyperbolic loss is used to encourage representations of scenes to lie close to representations of their constituent objects in a hyperbolic space. This novel hyperbolic objective encourages the scene-object hypernymy among the representations by optimizing the magnitude of their norms. We show that when pretraining on the COCO and OpenImages datasets, the hyperbolic loss improves downstream performance of several baselines across multiple datasets and tasks, including image classification, object detection, and semantic segmentation. We also show that the properties of the learned representations allow us to solve various vision tasks that involve the interaction between scenes and objects in a zero-shot fashion. Our code can be found at \url{https://github.com/shlokk/HCL/tree/main/HCL}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes