CVJun 18, 2024

MAC: A Benchmark for Multiple Attributes Compositional Zero-Shot Learning

arXiv:2406.12757v42 citations
Originality Incremental advance
AI Analysis

This work addresses the limitation of existing CZSL datasets for researchers by providing a more realistic benchmark, though it is incremental as it builds on prior CZSL frameworks.

The paper tackles the problem of compositional zero-shot learning (CZSL) by introducing a new dataset, MAC, which includes multiple interrelated attributes per object, addressing biases in existing single-attribute datasets, and results in a baseline method, MVP-Integrator, that significantly outperforms prior methods on this dataset with improved efficiency.

Compositional Zero-Shot Learning (CZSL) aims to learn semantic primitives (attributes and objects) from seen compositions and recognize unseen attribute-object compositions. Existing CZSL datasets focus on single attributes, neglecting the fact that objects naturally exhibit multiple interrelated attributes. Their narrow attribute scope and single attribute labeling introduce annotation biases, misleading the learning of attributes and causing inaccurate evaluation. To address these issues, we introduce the Multi-Attribute Composition (MAC) dataset, encompassing 22,838 images and 17,627 compositions with comprehensive and representative attribute annotations. MAC shows complex relationship between attributes and objects, with each attribute type linked to an average of 82.2 object types, and each object type associated with 31.4 attribute types. Based on MAC, we propose multi-attribute compositional zero-shot learning that requires deeper semantic understanding and advanced attribute associations, establishing a more realistic and challenging benchmark for CZSL. We also propose Multi-attribute Visual-Primitive Integrator (MVP-Integrator), a robust baseline for multi-attribute CZSL, which disentangles semantic primitives and performs effective visual-primitive association. Experimental results demonstrate that MVP-Integrator significantly outperforms existing CZSL methods on MAC with improved inference efficiency.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes