CVFeb 6

Taming SAM3 in the Wild: A Concept Bank for Open-Vocabulary Segmentation

arXiv:2602.06333v12 citationsh-index: 9Has Code
Originality Incremental advance
AI Analysis

This work addresses robustness issues in open-vocabulary segmentation for applications like natural-scene and remote-sensing analysis, representing an incremental improvement.

The paper tackles the vulnerability of SAM3 in Open-Vocabulary Segmentation to data and concept drift by introducing ConceptBank, a parameter-free calibration framework that adapts the model to distribution shifts, establishing a new baseline for robustness and efficiency in OVS.

The recent introduction of \texttt{SAM3} has revolutionized Open-Vocabulary Segmentation (OVS) through \textit{promptable concept segmentation}, which grounds pixel predictions in flexible concept prompts. However, this reliance on pre-defined concepts makes the model vulnerable: when visual distributions shift (\textit{data drift}) or conditional label distributions evolve (\textit{concept drift}) in the target domain, the alignment between visual evidence and prompts breaks down. In this work, we present \textsc{ConceptBank}, a parameter-free calibration framework to restore this alignment on the fly. Instead of adhering to static prompts, we construct a dataset-specific concept bank from the target statistics. Our approach (\textit{i}) anchors target-domain evidence via class-wise visual prototypes, (\textit{ii}) mines representative supports to suppress outliers under data drift, and (\textit{iii}) fuses candidate concepts to rectify concept drift. We demonstrate that \textsc{ConceptBank} effectively adapts \texttt{SAM3} to distribution drifts, including challenging natural-scene and remote-sensing scenarios, establishing a new baseline for robustness and efficiency in OVS. Code and model are available at https://github.com/pgsmall/ConceptBank.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes