LGAIApr 30

BoostLoRA: Growing Effective Rank by Boosting Adapters

arXiv:2604.2730878.9
AI Analysis

For practitioners of parameter-efficient fine-tuning, BoostLoRA overcomes the expressivity ceiling of ultra-low-parameter adapters without increasing inference cost.

BoostLoRA introduces a gradient-boosting framework for parameter-efficient fine-tuning that iteratively trains and merges minimal adapters on misclassified examples, enabling effective rank to grow linearly with rounds while maintaining zero inference overhead. On Qwen2.5-3B, it achieves 89.1% on GSM8K and 68.8% on MATH-500, surpassing both TinyLoRA and full fine-tuning.

Parameter-efficient fine-tuning (PEFT) methods face a tradeoff between adapter size and expressivity: ultra-low-parameter adapters are confined to fixed low-rank subspaces, capping performance even with extended training. We propose BoostLoRA, a gradient-boosting framework that overcomes this limit by iteratively training and merging minimal adapters on the examples the current model gets wrong. A ROTATE SVD basis strategy assigns each round to an orthogonal subspace, so cumulative effective rank grows linearly with the number of rounds while each adapter remains ultra-low-rank. After merging, adapters are discarded, leaving zero inference overhead. On Qwen2.5-3B, BoostLoRA reaches 89.1% on GSM8K and 68.8% on MATH-500, surpassing both the best single-shot ultra-low parameter adapter (TinyLoRA) and full fine-tuning; on code generation it reaches 57.2% on MBPP and 80.4% on HumanEval while full fine-tuning drops below the zero-shot baseline. We also demonstrate cross-architecture transfer on protein binding classification with ESM2-650M and cross-entropy training. BoostLoRA is, to our knowledge, the first PEFT method whose effective rank grows with training, separating per-round parameter cost from total representational capacity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes