IVCVApr 15

Learning Class Difficulty in Imbalanced Histopathology Segmentation via Dynamic Focal Attention

arXiv:2604.1347921.4h-index: 6
Predicted impact top 70% in IV · last 90 daysOriginality Incremental advance
AI Analysis

For medical image segmentation, DFA offers a simple, efficient alternative to loss reweighting that adaptively captures class difficulty at the representation level, improving performance on imbalanced histopathology datasets.

The paper tackles class imbalance in histopathology image segmentation, where frequency-based loss reweighting fails to capture true difficulty from morphological variability and boundary ambiguity. The proposed Dynamic Focal Attention (DFA) learns class-specific difficulty via a learnable bias in cross-attention, consistently improving Dice and IoU on three benchmarks (BDSA, BCSS, CRAG) without a separate estimator.

Semantic segmentation of histopathology images under class imbalance is typically addressed through frequency-based loss reweighting, which implicitly assumes that rare classes are difficult. However, true difficulty also arises from morphological variability, boundary ambiguity, and contextual similarity-factors that frequency cannot capture. We propose Dynamic Focal Attention (DFA), a simple and efficient mechanism that learns class-specific difficulty directly within the cross-attention of query-based mask decoders. DFA introduces a learnable per-class bias to attention logits, enabling representation-level reweighting prior to prediction rather than gradient-level reweighting after prediction. Initialised from a log-frequency prior to prevent gradient starvation, the bias is optimised end-to-end, allowing the model to adaptively capture difficulty signals through training, effectively unifying frequency-based and difficulty-aware approaches under a common attention-bias framework. On three histopathology benchmarks (BDSA, BCSS, CRAG), DFA consistently improves Dice and IoU, matching or exceeding a difficulty-aware baseline without a separate estimator or additional training stage. These results demonstrate that encoding class difficulty at the representation level provides a principled alternative to conventional loss reweighting for imbalanced segmentation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes