CVAICLFeb 3, 2025

AdaSVD: Adaptive Singular Value Decomposition for Large Language Models

arXiv:2502.01403v415 citationsh-index: 10Has Code
Originality Incremental advance
AI Analysis

This addresses memory constraints for deploying LLMs on devices like mobile or edge systems, offering an incremental improvement over existing SVD compression methods.

The paper tackles the problem of compressing large language models (LLMs) to reduce memory requirements for deployment on resource-constrained devices, proposing AdaSVD, an adaptive SVD-based method that outperforms state-of-the-art SVD techniques with significantly reduced memory overhead.

Large language models (LLMs) have achieved remarkable success in natural language processing (NLP) tasks, yet their substantial memory requirements present significant challenges for deployment on resource-constrained devices. Singular Value Decomposition (SVD) has emerged as a promising compression technique for LLMs, offering considerable reductions in memory overhead. However, existing SVD-based methods often struggle to effectively mitigate the errors introduced by SVD truncation, leading to a noticeable performance gap when compared to the original models. Furthermore, applying a uniform compression ratio across all transformer layers fails to account for the varying importance of different layers. To address these challenges, we propose AdaSVD, an adaptive SVD-based LLM compression approach. Specifically, AdaSVD introduces adaComp, which adaptively compensates for SVD truncation errors by alternately updating the singular matrices $\mathcal{U}$ and $\mathcal{V}^\top$. Additionally, AdaSVD introduces adaCR, which adaptively assigns layer-specific compression ratios based on the relative importance of each layer. Extensive experiments across multiple LLM/VLM families and evaluation metrics demonstrate that AdaSVD consistently outperforms state-of-the-art (SOTA) SVD-based methods, achieving superior performance with significantly reduced memory requirements. Code and models of AdaSVD will be available at https://github.com/ZHITENGLI/AdaSVD.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes