CVJan 9, 2024

MST: Adaptive Multi-Scale Tokens Guided Interactive Segmentation

arXiv:2401.04403v22 citationsh-index: 9Has CodeIEEE transactions on circuits and systems for video technology (Print)
AI Analysis

This work addresses scale variation issues in interactive segmentation for applications like human-computer interaction and data annotation, representing an incremental improvement.

The paper tackles the problem of target scale variation in interactive segmentation by proposing a multi-scale token adaptation algorithm with contrastive loss-based token learning, achieving state-of-the-art performance in benchmarks.

Interactive segmentation has gained significant attention for its application in human-computer interaction and data annotation. To address the target scale variation issue in interactive segmentation, a novel multi-scale token adaptation algorithm is proposed. By performing top-k operations across multi-scale tokens, the computational complexity is greatly simplified while ensuring performance. To enhance the robustness of multi-scale token selection, we also propose a token learning algorithm based on contrastive loss. This algorithm can effectively improve the performance of multi-scale token adaptation. Extensive benchmarking shows that the algorithm achieves state-of-the-art (SOTA) performance, compared to current methods. An interactive demo and all reproducible codes will be released at https://github.com/hahamyt/mst.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes