LGMay 29, 2025

Weight Spectra Induced Efficient Model Adaptation

Chongjie Si, Xuankun Yang, Muqing Liu, Yadao Wang, Xiaokang Yang, Wenbo Su, Bo Zheng, Wei Shen

arXiv:2505.23099v111.43 citationsh-index: 13

Originality Incremental advance

AI Analysis

This work addresses the computational cost of fine-tuning large foundation models for researchers and practitioners, but it is incremental as it builds on existing PEFT methods like LoRA.

The paper tackled the problem of understanding how Parameter-Efficient Fine-Tuning (PEFT) modifies model parameters by analyzing structural changes in weight matrices during fine-tuning, revealing that task-specific knowledge is injected into a low-dimensional subspace and proposing a method that improves performance over baselines across multiple tasks.

Large-scale foundation models have demonstrated remarkable versatility across a wide range of downstream tasks. However, fully fine-tuning these models incurs prohibitive computational costs, motivating the development of Parameter-Efficient Fine-Tuning (PEFT) methods such as LoRA, which introduces low-rank updates to pre-trained weights. Despite their empirical success, the underlying mechanisms by which PEFT modifies model parameters remain underexplored. In this work, we present a systematic investigation into the structural changes of weight matrices during fully fine-tuning. Through singular value decomposition (SVD), we reveal that fine-tuning predominantly amplifies the top singular values while leaving the remainder largely intact, suggesting that task-specific knowledge is injected into a low-dimensional subspace. Furthermore, we find that the dominant singular vectors are reoriented in task-specific directions, whereas the non-dominant subspace remains stable. Building on these insights, we propose a novel method that leverages learnable rescaling of top singular directions, enabling precise modulation of the most influential components without disrupting the global structure. Our approach achieves consistent improvements over strong baselines across multiple tasks, highlighting the efficacy of structurally informed fine-tuning.

View on arXiv PDF

Similar