CVMar 25, 2024

Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships

arXiv:2403.17173v22 citationsh-index: 7CVPR
Originality Highly original
AI Analysis

This addresses the need for better representation and visualization of asymmetric task relationships in meta-tasks like dataset discovery and transfer learning, offering an interpretable solution.

The paper tackled the problem of modeling asymmetric relationships between tasks, such as containment and transferability, by proposing Task2Box, which uses box embeddings to capture these relationships through volumetric overlaps. The result showed that Task2Box accurately predicts unseen hierarchical relationships in ImageNet and iNaturalist datasets and transferability in the Taskonomy benchmark, outperforming classifiers and handcrafted asymmetric distances.

Modeling and visualizing relationships between tasks or datasets is an important step towards solving various meta-tasks such as dataset discovery, multi-tasking, and transfer learning. However, many relationships, such as containment and transferability, are naturally asymmetric and current approaches for representation and visualization (e.g., t-SNE) do not readily support this. We propose Task2Box, an approach to represent tasks using box embeddings -- axis-aligned hyperrectangles in low dimensional spaces -- that can capture asymmetric relationships between them through volumetric overlaps. We show that Task2Box accurately predicts unseen hierarchical relationships between nodes in ImageNet and iNaturalist datasets, as well as transferability between tasks in the Taskonomy benchmark. We also show that box embeddings estimated from task representations (e.g., CLIP, Task2Vec, or attribute based) can be used to predict relationships between unseen tasks more accurately than classifiers trained on the same representations, as well as handcrafted asymmetric distances (e.g., KL divergence). This suggests that low-dimensional box embeddings can effectively capture these task relationships and have the added advantage of being interpretable. We use the approach to visualize relationships among publicly available image classification datasets on popular dataset hosting platform called Hugging Face.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes