CVJun 27, 2023

What a MESS: Multi-Domain Evaluation of Zero-Shot Semantic Segmentation

arXiv:2306.15521v330 citationsh-index: 12Has Code
Originality Synthesis-oriented
AI Analysis

This work provides a standardized benchmark for researchers to assess zero-shot semantic segmentation models across diverse domains, though it is incremental as it builds on existing methods without introducing new techniques.

The authors tackled the problem of limited generalization in semantic segmentation by creating a benchmark for multi-domain evaluation of zero-shot semantic segmentation, resulting in the MESS benchmark with 22 datasets across domains like medicine and agriculture, and evaluating eight models to analyze performance characteristics.

While semantic segmentation has seen tremendous improvements in the past, there are still significant labeling efforts necessary and the problem of limited generalization to classes that have not been present during training. To address this problem, zero-shot semantic segmentation makes use of large self-supervised vision-language models, allowing zero-shot transfer to unseen classes. In this work, we build a benchmark for Multi-domain Evaluation of Semantic Segmentation (MESS), which allows a holistic analysis of performance across a wide range of domain-specific datasets such as medicine, engineering, earth monitoring, biology, and agriculture. To do this, we reviewed 120 datasets, developed a taxonomy, and classified the datasets according to the developed taxonomy. We select a representative subset consisting of 22 datasets and propose it as the MESS benchmark. We evaluate eight recently published models on the proposed MESS benchmark and analyze characteristics for the performance of zero-shot transfer models. The toolkit is available at https://github.com/blumenstiel/MESS.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes