LG CRJan 16

Differentially Private Subspace Fine-Tuning for Large Language Models

Lele Zheng, Xiang Wang, Tao Zhang, Yang Cao, Ke Cheng, Yulong Shen

arXiv:2601.11113v1h-index: 6

Originality Incremental advance

AI Analysis

This addresses privacy concerns in fine-tuning large language models for downstream tasks, offering a more efficient approach for applications using sensitive data, though it is incremental as it builds on existing DP fine-tuning methods.

The paper tackles the problem of fine-tuning large language models on sensitive data with differential privacy, which often degrades performance due to high-dimensional noise, and proposes a subspace-based method that reduces noise impact, resulting in improved accuracy, stability, and convergence under privacy constraints.

Fine-tuning large language models on downstream tasks is crucial for realizing their cross-domain potential but often relies on sensitive data, raising privacy concerns. Differential privacy (DP) offers rigorous privacy guarantees and has been widely adopted in fine-tuning; however, naively injecting noise across the high-dimensional parameter space creates perturbations with large norms, degrading performance and destabilizing training. To address this issue, we propose DP-SFT, a two-stage subspace fine-tuning method that substantially reduces noise magnitude while preserving formal DP guarantees. Our intuition is that, during fine-tuning, significant parameter updates lie within a low-dimensional, task-specific subspace, while other directions change minimally. Hence, we only inject DP noise into this subspace to protect privacy without perturbing irrelevant parameters. In phase one, we identify the subspace by analyzing principal gradient directions to capture task-specific update signals. In phase two, we project full gradients onto this subspace, add DP noise, and map the perturbed gradients back to the original parameter space for model updates, markedly lowering noise impact. Experiments on multiple datasets demonstrate that DP-SFT enhances accuracy and stability under rigorous DP constraints, accelerates convergence, and achieves substantial gains over DP fine-tuning baselines.

View on arXiv PDF

Similar