CVDec 11, 2025

Take a Peek: Efficient Encoder Adaptation for Few-Shot Semantic Segmentation via LoRA

arXiv:2512.10521v11 citationsh-index: 21Has Code
Originality Incremental advance
AI Analysis

This work addresses a critical bottleneck in few-shot semantic segmentation for computer vision applications, offering an incremental improvement by enhancing encoder adaptability with minimal computational overhead.

The paper tackles the problem of few-shot semantic segmentation by addressing the encoder's limited ability to extract features for unseen classes, introducing TaP, a method that uses LoRA to fine-tune the encoder on a support set, resulting in improved segmentation performance across multiple benchmarks and shot settings.

Few-shot semantic segmentation (FSS) aims to segment novel classes in query images using only a small annotated support set. While prior research has mainly focused on improving decoders, the encoder's limited ability to extract meaningful features for unseen classes remains a key bottleneck. In this work, we introduce \textit{Take a Peek} (TaP), a simple yet effective method that enhances encoder adaptability for both FSS and cross-domain FSS (CD-FSS). TaP leverages Low-Rank Adaptation (LoRA) to fine-tune the encoder on the support set with minimal computational overhead, enabling fast adaptation to novel classes while mitigating catastrophic forgetting. Our method is model-agnostic and can be seamlessly integrated into existing FSS pipelines. Extensive experiments across multiple benchmarks--including COCO $20^i$, Pascal $5^i$, and cross-domain datasets such as DeepGlobe, ISIC, and Chest X-ray--demonstrate that TaP consistently improves segmentation performance across diverse models and shot settings. Notably, TaP delivers significant gains in complex multi-class scenarios, highlighting its practical effectiveness in realistic settings. A rank sensitivity analysis also shows that strong performance can be achieved even with low-rank adaptations, ensuring computational efficiency. By addressing a critical limitation in FSS--the encoder's generalization to novel classes--TaP paves the way toward more robust, efficient, and generalizable segmentation systems. The code is available at https://github.com/pasqualedem/TakeAPeek.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes