LG AIFeb 27

FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA

Haoran Zhang, Dongjun Kim, Seohyeon Cha, Haris Vikalo

arXiv:2602.23638v1

Originality Highly original

AI Analysis

This addresses a specific bottleneck in federated learning for large language models, offering a communication-efficient solution to improve training stability and accuracy in decentralized settings.

The paper tackled the problem of rotational misalignment in federated LoRA, which causes aggregation errors and unstable training, by proposing FedRot-LoRA to align client updates with orthogonal transformations, resulting in consistent performance improvements over baselines across various heterogeneity levels and LoRA ranks.

Federated LoRA provides a communication-efficient mechanism for fine-tuning large language models on decentralized data. In practice, however, a discrepancy between the factor-wise averaging used to preserve low rank and the mathematically correct aggregation of local updates can cause significant aggregation error and unstable training. We argue that a major source of this problem is rotational misalignment, arising from the rotational invariance of low-rank factorizations -- semantically equivalent updates can be represented in different latent subspaces across clients since $(B_i R_i)(R_i^\top A_i) = B_i A_i$. When such misaligned factors are averaged directly, they interfere destructively and degrade the global update. To address this issue, we propose FedRot-LoRA, a federated LoRA framework that aligns client updates via orthogonal transformations prior to aggregation. This alignment preserves the semantic update while reducing cross-client subspace mismatch, without increasing communication cost or restricting model expressivity. We provide a convergence analysis that examines the aggregation error induced by factor-wise averaging and shows how rotational alignment yields a tighter upper bound on this error. Extensive experiments on natural language understanding and generative tasks demonstrate that FedRot-LoRA consistently outperforms existing federated LoRA baselines across a range of heterogeneity levels and LoRA ranks.

View on arXiv PDF

Similar