CVAIHCLGIVOct 11, 2025

Explainable Human-in-the-Loop Segmentation via Critic Feedback Signals

arXiv:2510.09945v1h-index: 5
Originality Incremental advance
AI Analysis

This work addresses the need for accurate, robust, and data-efficient segmentation systems for researchers and practitioners in domains like urban climate monitoring and autonomous driving, offering a practical framework with incremental improvements over existing methods.

The paper tackles the problem of segmentation models relying on spurious correlations in real-world domains by proposing a human-in-the-loop interactive framework that uses targeted human corrections as interventional signals to steer the model toward robust features. It demonstrates improvements of up to 9 mIoU points (12-15% relative) on challenging data and reduces annotation effort by 3-4x compared to standard retraining.

Segmentation models achieve high accuracy on benchmarks but often fail in real-world domains by relying on spurious correlations instead of true object boundaries. We propose a human-in-the-loop interactive framework that enables interventional learning through targeted human corrections of segmentation outputs. Our approach treats human corrections as interventional signals that show when reliance on superficial features (e.g., color or texture) is inappropriate. The system learns from these interventions by propagating correction-informed edits across visually similar images, effectively steering the model toward robust, semantically meaningful features rather than dataset-specific artifacts. Unlike traditional annotation approaches that simply provide more training data, our method explicitly identifies when and why the model fails and then systematically corrects these failure modes across the entire dataset. Through iterative human feedback, the system develops increasingly robust representations that generalize better to novel domains and resist artifactual correlations. We demonstrate that our framework improves segmentation accuracy by up to 9 mIoU points (12-15\% relative improvement) on challenging cubemap data and yields 3-4$\times$ reductions in annotation effort compared to standard retraining, while maintaining competitive performance on benchmark datasets. This work provides a practical framework for researchers and practitioners seeking to build segmentation systems that are accurate, robust to dataset biases, data-efficient, and adaptable to real-world domains such as urban climate monitoring and autonomous driving.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes