CVOct 24, 2025

Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts

Yanguang Sun, Jiawei Lian, Jian Yang, Lei Luo

arXiv:2510.21114v17 citationsh-index: 6Has Code

Originality Incremental advance

AI Analysis

This work addresses the bottleneck of training efficiency for researchers and practitioners adapting large models to specific segmentation tasks, offering an incremental improvement over existing fine-tuning methods.

The paper tackles the problem of computational inefficiency in fine-tuning large foundation models for object segmentation by proposing Controllable-LPMoE, a method that uses dynamic local priors and a mixture-of-experts approach to reduce trainable parameters, achieving superior segmentation performance compared to 31 state-of-the-art methods.

Large-scale foundation models provide powerful feature representations for downstream object segmentation tasks. However, when adapted to specific tasks through the full-parameter fine-tuning, the enormous parameters being updated often results in significant computational overhead, creating a bottleneck in training efficiency. Although existing methods attempt to fine-tune frozen models by directly embedding trainable prompts, these prompts lack inherent semantic priors, limiting the adaptability of large-scale models. In this paper, we propose a novel dynamic priors-based fine-tuning paradigm with fewer trainable parameters, dubbed Controllable-LPMoE, which adaptively modulates frozen foundation models by dynamically controlling local priors to enhance fine-grained perception for specific segmentation tasks. More specifically, we construct a lightweight dynamic mixed local priors extractor that captures diverse local priors from input images through heterogeneous convolutions while employing a gating network to dynamically output expert priors required for the subsequent fine-tuning. Furthermore, we design a bi-directional interaction adapter that employs cosine-aligned deformable attention and channel-oriented adaptive scale enhancement to interact and restructure between frozen and trainable features, achieving efficient fine-tuning. Extensive experiments validate the superiority of our \href{https://github.com/CSYSI/Controllable-LPMoE} {Controllable-LPMoE} approach, demonstrating excellent segmentation performance compared to 31 state-of-the-art (SOTA) methods and adaptability to multiple binary object segmentation tasks.

View on arXiv PDF Code

Similar