CVJul 15, 2023

Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation

arXiv:2307.07812v13 citationsh-index: 37Has Code
Originality Incremental advance
AI Analysis

This work addresses the problem of segmenting novel objects in videos with limited labeled data, representing an incremental improvement in method design for this domain-specific task.

The paper tackles few-shot video segmentation by introducing a Multiscale Memory Comparator (MMC) that preserves detailed feature maps across scales in a transformer decoder, outperforming baselines and achieving state-of-the-art results.

Few-shot video segmentation is the task of delineating a specific novel class in a query video using few labelled support images. Typical approaches compare support and query features while limiting comparisons to a single feature layer and thereby ignore potentially valuable information. We present a meta-learned Multiscale Memory Comparator (MMC) for few-shot video segmentation that combines information across scales within a transformer decoder. Typical multiscale transformer decoders for segmentation tasks learn a compressed representation, their queries, through information exchange across scales. Unlike previous work, we instead preserve the detailed feature maps during across scale information exchange via a multiscale memory transformer decoding to reduce confusion between the background and novel class. Integral to the approach, we investigate multiple forms of information exchange across scales in different tasks and provide insights with empirical evidence on which to use in each task. The overall comparisons among query and support features benefit from both rich semantics and precise localization. We demonstrate our approach primarily on few-shot video object segmentation and an adapted version on the fully supervised counterpart. In all cases, our approach outperforms the baseline and yields state-of-the-art performance. Our code is publicly available at https://github.com/MSiam/MMC-MultiscaleMemory.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes