CVJan 27, 2025

SketchYourSeg: Mask-Free Subjective Image Segmentation via Freehand Sketches

arXiv:2501.16022v21 citationsh-index: 33
Originality Highly original
AI Analysis

This addresses the challenge of precise user-guided segmentation for applications like image editing or retrieval, offering a novel approach that combines semantic and structural intent.

The paper tackles the problem of subjective image segmentation by introducing SketchYourSeg, a framework that uses freehand sketches as queries to segment images across galleries without needing pixel-perfect masks during training, achieving superior performance over existing methods in benchmarks.

We introduce SketchYourSeg, a novel framework that establishes freehand sketches as a powerful query modality for subjective image segmentation across entire galleries through a single exemplar sketch. Unlike text prompts that struggle with spatial specificity or interactive methods confined to single-image operations, sketches naturally combine semantic intent with structural precision. This unique dual encoding enables precise visual disambiguation for segmentation tasks where text descriptions would be cumbersome or ambiguous -- such as distinguishing between visually similar instances, specifying exact part boundaries, or indicating spatial relationships in composed concepts. Our approach addresses three fundamental challenges: (i) eliminating the need for pixel-perfect annotation masks during training with a mask-free framework; (ii) creating a synergistic relationship between sketch-based image retrieval (SBIR) models and foundation models (CLIP/DINOv2) where the former provides training signals while the latter generates masks; and (iii) enabling multi-granular segmentation capabilities through purpose-made sketch augmentation strategies. Our extensive evaluations demonstrate superior performance over existing approaches across diverse benchmarks, establishing a new paradigm for user-guided image segmentation that balances precision with efficiency.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes