CVApr 27

Dino-NestedUNet: Unlocking Foundation Vision Encoders for Pathology Tumor Bulk Segmentation via Dense Decoding

arXiv:2605.0089437.8h-index: 7
Predicted impact top 80% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses the capacity mismatch between frozen foundation encoders and lightweight decoders for boundary-sensitive pathology segmentation, offering a practical improvement for cross-domain generalization.

Dino-NestedUNet couples a DINOv3 encoder with a Nested Dense Decoder to improve boundary fidelity in tumor bulk segmentation. It achieves consistent improvements over UNet++ and Dino-UNet variants across three histopathology cohorts, with zero-shot generalization to unseen datasets.

Vision foundation models (VFMs), such as DINOv3, provide rich semantic representations that are promising for computational pathology. However, many current adaptations pair frozen VFMs with lightweight decoders, creating a capacity mismatch that often limits boundary fidelity for infiltrative tumor bulk segmentation. This paper presents Dino-NestedUNet, a framework that couples a pre-trained DINOv3 encoder with a Nested Dense Decoder. Instead of sparse skip connections and linear upsampling, the proposed decoder forms a dense grid of intermediate pathways to enable continuous feature reuse and multi-scale recalibration, aligning high-level semantics with low-level morphological textures during reconstruction. We evaluate Dino-NestedUNet on three histopathology cohorts (multi-center CHTN, institutional OSU, and CAMELYON16) and observe consistent improvements over UNet++ and standard Dino-UNet variants, particularly under cross-domain shift. To further assess external generalization, we perform zero-shot evaluation by training on CHTN and directly testing on unseen TIGER WSIBULK and OSU CRC cohorts without fine-tuning. These results suggest that dense decoding is a key ingredient for unlocking foundation encoders in boundary-sensitive pathology segmentation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes