CVDec 29, 2022

AttEntropy: On the Generalization Ability of Supervised Semantic Segmentation Transformers to New Objects in New Domains

arXiv:2212.14397v34 citationsh-index: 15
Originality Incremental advance
AI Analysis

This work addresses the problem of limited generalization in semantic segmentation for researchers and practitioners, showing incremental improvement by extending transformer capabilities to new objects and domains without retraining.

The paper demonstrates that supervised semantic segmentation transformers, once trained on a set of categories, can generalize to segment objects from unseen classes across diverse domains such as road obstacles, aircraft, lunar rocks, and maritime hazards, leveraging information not explicitly trained for.

In addition to impressive performance, vision transformers have demonstrated remarkable abilities to encode information they were not trained to extract. For example, this information can be used to perform segmentation or single-view depth estimation even though the networks were only trained for image recognition. We show that a similar phenomenon occurs when explicitly training transformers for semantic segmentation in a supervised manner for a set of categories: Once trained, they provide valuable information even about categories absent from the training set. This information can be used to segment objects from these never-seen-before classes in domains as varied as road obstacles, aircraft parked at a terminal, lunar rocks, and maritime hazards.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes