CV AIJun 1

Parameter-Efficient Fine-Tuning of Large Pretrained Models for Instance Segmentation Tasks

Nermeen Abou Baker, David Rohrschneider, Uwe Handmann

arXiv:2606.019477.510 citations

Predicted impact top 85% in CV · last 90 daysOriginality Synthesis-oriented

AI Analysis

For practitioners needing efficient fine-tuning of large models for instance segmentation, this paper provides a systematic evaluation of PEFT methods, though the findings are incremental and domain-specific.

This work investigates parameter-efficient fine-tuning (PEFT) methods (adapters and LoRA) for transformer-based instance segmentation, achieving competitive performance while fine-tuning only 1-6% of parameters compared to 40-55% in traditional fine-tuning. Key findings include optimal use of 2-3 adapters per transformer block and LoRA's strong performance when applied to deformable attention.

Research and applications in artificial intelligence have recently shifted with the rise of large pretrained models, which deliver state-of-the-art results across numerous tasks. However, the substantial increase in parameters introduces a need for parameter-efficient training strategies. Despite significant advancements, limited research has explored parameter-efficient fine-tuning (PEFT) methods in the context of transformer-based models for instance segmentation. Addressing this gap, this study investigates the effectiveness of PEFT methods, specifically adapters and Low-Rank Adaptation (LoRA), applied to two models across four benchmark datasets. Integrating sequentially arranged adapter modules and applying LoRA to deformable attention--explored here for the first time--achieves competitive performance while fine-tuning only about 1-6% of model parameters, a marked improvement over the 40-55% required in traditional fine-tuning. Key findings indicate that using 2-3 adapters per transformer block offers an optimal balance of performance and efficiency. Furthermore, LoRA, exhibits strong parameter efficiency when applied to deformable attention, and in certain cases surpasses adapter configurations. These results show that the impact of PEFT techniques varies based on dataset complexity and model architecture, underscoring the importance of context-specific tuning. Overall, this work demonstrates the potential of PEFT to enable scalable, customizable, and computationally efficient transfer learning for instance segmentation tasks.

View on arXiv PDF

Similar