CVOct 25, 2025

Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation

arXiv:2510.22229v1h-index: 11Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of expensive pixel-level annotations for semantic segmentation under constrained labeling budgets, offering a novel method that is incremental in combining existing techniques like diffusion models and entropy-augmented scores.

The paper tackles low-budget active learning for semantic segmentation by proposing a two-stage selection pipeline that decouples diversity and uncertainty, achieving high segmentation accuracy with only a tiny fraction of labeled pixels, as demonstrated by significantly outperforming existing baselines on four benchmarks under extreme pixel-budget regimes.

Semantic segmentation demands dense pixel-level annotations, which can be prohibitively expensive - especially under extremely constrained labeling budgets. In this paper, we address the problem of low-budget active learning for semantic segmentation by proposing a novel two-stage selection pipeline. Our approach leverages a pre-trained diffusion model to extract rich multi-scale features that capture both global structure and fine details. In the first stage, we perform a hierarchical, representation-based candidate selection by first choosing a small subset of representative pixels per image using MaxHerding, and then refining these into a diverse global pool. In the second stage, we compute an entropy-augmented disagreement score (eDALD) over noisy multi-scale diffusion features to capture both epistemic uncertainty and prediction confidence, selecting the most informative pixels for annotation. This decoupling of diversity and uncertainty lets us achieve high segmentation accuracy with only a tiny fraction of labeled pixels. Extensive experiments on four benchmarks (CamVid, ADE-Bed, Cityscapes, and Pascal-Context) demonstrate that our method significantly outperforms existing baselines under extreme pixel-budget regimes. Our code is available at https://github.com/jn-kim/two-stage-edald.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes