CVFeb 4

Mitigating Long-Tail Bias via Prompt-Controlled Diffusion Augmentation

arXiv:2602.04749v1h-index: 14Has Code
AI Analysis

This addresses data imbalance in remote-sensing segmentation for urban mapping and land-cover monitoring, representing an incremental advance in controllable augmentation methods.

The paper tackles long-tailed pixel imbalance in remote-sensing semantic segmentation by introducing a prompt-controlled diffusion augmentation framework that synthesizes paired label-image samples with control over domain and class ratios, achieving consistent improvements across segmentation backbones with gains focused on minority classes and enhanced Urban and Rural generalization.

Semantic segmentation of high-resolution remote-sensing imagery is critical for urban mapping and land-cover monitoring, yet training data typically exhibits severe long-tailed pixel imbalance. In the dataset LoveDA, this challenge is compounded by an explicit Urban/Rural split with distinct appearance and inconsistent class-frequency statistics across domains. We present a prompt-controlled diffusion augmentation framework that synthesizes paired label--image samples with explicit control of both domain and semantic composition. Stage~A uses a domain-aware, masked ratio-conditioned discrete diffusion model to generate layouts that satisfy user-specified class-ratio targets while respecting learned co-occurrence structure. Stage~B translates layouts into photorealistic, domain-consistent images using Stable Diffusion with ControlNet guidance. Mixing the resulting ratio and domain-controlled synthetic pairs with real data yields consistent improvements across multiple segmentation backbones, with gains concentrated on minority classes and improved Urban and Rural generalization, demonstrating controllable augmentation as a practical mechanism to mitigate long-tail bias in remote-sensing segmentation. Source codes, pretrained models, and synthetic datasets are available at \href{https://github.com/Buddhi19/SyntheticGen.git}{Github}

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes