CVDec 31, 2025

Collaborative Low-Rank Adaptation for Pre-Trained Vision Transformers

arXiv:2512.24603v1h-index: 1
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in fine-tuning pre-trained vision transformers for downstream tasks, offering an incremental improvement in parameter efficiency and performance.

The paper tackles the problem of balancing fine-tuning performance and parameter efficiency in low-rank adaptation for vision transformers, proposing CLoRA, which achieves better balance and requires the fewest GFLOPs for point cloud analysis compared to state-of-the-art methods.

Low-rank adaptation (LoRA) has achieved remarkable success in fine-tuning pre-trained vision transformers for various downstream tasks. Existing studies mainly focus on exploring more parameter-efficient strategies or more effective representation learning schemes. However, these methods either sacrifice fine-tuning performance or introduce excessive trainable parameters, failing to strike a balance between learning performance and parameter efficiency. To address this problem, we propose a novel tuning method named collaborative low-rank adaptation (CLoRA) in this paper. CLoRA consists of base-space sharing and sample-agnostic diversity enhancement (SADE) components. To maintain parameter efficiency while expanding the learning capacity of low-rank modules (LRMs), base-space sharing allows all LRMs to share a set of down/up-projection spaces. In CLoRA, the low-rank matrices obtained from the shared spaces collaboratively construct each LRM. Since the representations extracted by these matrices may contain redundant information, SADE is employed to regularize the similarities among them to encourage diverse representations in the training process. We conduct extensive experiments on widely used image and point cloud datasets to evaluate the performance of CLoRA. Experimental results demonstrate that CLoRA strikes a better balance between learning performance and parameter efficiency, while requiring the fewest GFLOPs for point cloud analysis, compared with the state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes