DCApr 8

Beyond End-to-End: Dynamic Chain Optimization for Private LLM Adaptation on the Edge

arXiv:2604.0681970.91 citationsh-index: 9
Predicted impact top 11% in DC · last 90 daysOriginality Highly original
AI Analysis

This addresses privacy-preserving LLM adaptation for edge computing, offering a novel solution to memory constraints.

The paper tackles the memory bottleneck in federated fine-tuning of LLMs on edge devices by proposing ChainFed, a sequential layer-by-layer training paradigm, which boosts average accuracy by up to 46.46% on multiple benchmarks.

Federated fine-tuning enables privacy-preserving LLM adaptation but faces a critical bottleneck: the disparity between LLMs' high memory demands and edge devices' limited capacity. To break the memory barrier, we propose Chain Federated Fine-Tuning (ChainFed), an innovative paradigm that forgoes end-to-end updates in favor of a sequential, layer-by-layer manner. It first trains the initial adapter to convergence, freezes its weights, and then proceeds to the next. This iterative train-and-freeze process forms an optimization chain, gradually enhancing the model's task-specific proficiency. ChainFed further integrates three core techniques: 1) Dynamic Layer Co-Tuning to bridge semantic gaps between sequentially tuned layers and facilitate information flow; 2) Globally Perceptive Optimization to endow each adapter with foresight beyond its local objective; 3) Function-Oriented Adaptive Tuning to automatically identify the optimal fine-tuning starting point. Extensive experiments on multiple benchmarks demonstrate the superiority of ChainFed over existing methods, boosting average accuracy by up to 46.46\%.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes