CVMar 23, 2025

FisherTune: Fisher-Guided Robust Tuning of Vision Foundation Models for Domain Generalized Segmentation

Dong Zhao, Jinlong Li, Shuang Wang, Mengyao Wu, Qi Zang, Nicu Sebe, Zhun Zhong

arXiv:2503.17940v217 citationsh-index: 30CVPR

Originality Incremental advance

AI Analysis

This addresses the problem of domain generalization in semantic segmentation for computer vision applications, representing an incremental improvement over prior fine-tuning approaches.

The paper tackles the challenge of fine-tuning Vision Foundation Models for Domain Generalized Semantic Segmentation while preserving generalization, proposing FisherTune, which uses Domain-Related Fisher Information Matrix to guide selective parameter updates, achieving superior cross-domain segmentation performance compared to existing methods.

Vision Foundation Models (VFMs) excel in generalization due to large-scale pretraining, but fine-tuning them for Domain Generalized Semantic Segmentation (DGSS) while maintaining this ability remains challenging. Existing approaches either selectively fine-tune parameters or freeze the VFMs and update only the adapters, both of which may underutilize the VFMs' full potential in DGSS tasks. We observe that domain-sensitive parameters in VFMs, arising from task and distribution differences, can hinder generalization. To address this, we propose \textbf{FisherTune}, a robust fine-tuning method guided by the Domain-Related Fisher Information Matrix (DR-FIM). DR-FIM measures parameter sensitivity across tasks and domains, enabling selective updates that preserve generalization and enhance DGSS adaptability. FisherTune incorporates variational inference to stabilize DR-FIM estimation, treating parameters as Gaussian-distributed variables and leveraging pre-trained priors. Extensive experiments show that FisherTune achieves superior cross-domain segmentation while maintaining generalization, outperforming selective-parameter and adapter-based methods.

View on arXiv PDF

Similar