CVAug 20, 2025

Towards PerSense++: Advancing Training-Free Personalized Instance Segmentation in Dense Images

arXiv:2508.14660v12 citationsh-index: 2
Originality Incremental advance
AI Analysis

This addresses the problem of accurate instance segmentation in cluttered, occluded images for computer vision applications, representing a domain-specific incremental improvement.

The paper tackles instance segmentation in dense visual scenes by introducing PerSense++, a training-free framework that generates instance-level candidate point prompts using density maps and filters false positives with adaptive thresholding and spatial gating. It achieves state-of-the-art performance on multiple benchmarks for this task.

Segmentation in dense visual scenes poses significant challenges due to occlusions, background clutter, and scale variations. To address this, we introduce PerSense, an end-to-end, training-free, and model-agnostic one-shot framework for Personalized instance Segmentation in dense images. PerSense employs a novel Instance Detection Module (IDM) that leverages density maps (DMs) to generate instance-level candidate point prompts, followed by a Point Prompt Selection Module (PPSM) that filters false positives via adaptive thresholding and spatial gating. A feedback mechanism further enhances segmentation by automatically selecting effective exemplars to improve DM quality. We additionally present PerSense++, an enhanced variant that incorporates three additional components to improve robustness in cluttered scenes: (i) a diversity-aware exemplar selection strategy that leverages feature and scale diversity for better DM generation; (ii) a hybrid IDM combining contour and peak-based prompt generation for improved instance separation within complex density patterns; and (iii) an Irrelevant Mask Rejection Module (IMRM) that discards spatially inconsistent masks using outlier analysis. Finally, to support this underexplored task, we introduce PerSense-D, a dedicated benchmark for personalized segmentation in dense images. Extensive experiments across multiple benchmarks demonstrate that PerSense++ outperforms existing methods in dense settings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes