LGDCJan 12, 2024

Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models

arXiv:2401.06432v2135 citationsh-index: 15EMNLP
Originality Incremental advance
AI Analysis

This addresses privacy-preserving fine-tuning for on-device foundation models, but it is incremental as it builds on existing LoRA and federated learning techniques.

The paper tackles the data and system heterogeneity problem in federated fine-tuning of on-device foundation models by proposing HetLoRA, a method using heterogeneous low-rank approximations, which achieves improved convergence speed and final performance compared to homogeneous LoRA.

Foundation models (FMs) adapt well to specific domains or tasks with fine-tuning, and federated learning (FL) enables the potential for privacy-preserving fine-tuning of the FMs with on-device local data. For federated fine-tuning of FMs, we consider the FMs with small to medium parameter sizes of single digit billion at maximum, referred to as on-device FMs (ODFMs) that can be deployed on devices for inference but can only be fine-tuned with parameter efficient methods. In our work, we tackle the data and system heterogeneity problem of federated fine-tuning of ODFMs by proposing a novel method using heterogeneous low-rank approximations (LoRAs), namely HetLoRA. First, we show that the naive approach of using homogeneous LoRA ranks across devices face a trade-off between overfitting and slow convergence, and thus propose HetLoRA, which allows heterogeneous ranks across client devices and efficiently aggregates and distributes these heterogeneous LoRA modules. By applying rank self-pruning locally and sparsity-weighted aggregation at the server, HetLoRA combines the advantages of high and low-rank LoRAs, which achieves improved convergence speed and final performance compared to homogeneous LoRA. Furthermore, HetLoRA offers enhanced computation efficiency compared to full fine-tuning, making it suitable for federated fine-tuning across heterogeneous devices.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes