CLSep 15, 2023

FedJudge: Federated Legal Large Language Model

Linan Yue, Qi Liu, Yichao Du, Weibo Gao, Ye Liu, Fangzhou Yao

arXiv:2309.08173v32.513 citationsh-index: 13Has Code

Originality Incremental advance

AI Analysis

This addresses data privacy concerns for legal professionals and institutions by enabling decentralized training of Legal LLMs, though it is incremental as it adapts existing FL and fine-tuning methods to a specific domain.

The paper tackles the challenge of training Legal Large Language Models (LLMs) with data privacy by integrating them with Federated Learning (FL), proposing FedJudge to fine-tune LLMs efficiently and effectively while mitigating data distribution shifts, with experimental validation on three real-world datasets.

Large Language Models (LLMs) have gained prominence in the field of Legal Intelligence, offering potential applications in assisting legal professionals and laymen. However, the centralized training of these Legal LLMs raises data privacy concerns, as legal data is distributed among various institutions containing sensitive individual information. This paper addresses this challenge by exploring the integration of Legal LLMs with Federated Learning (FL) methodologies. By employing FL, Legal LLMs can be fine-tuned locally on devices or clients, and their parameters are aggregated and distributed on a central server, ensuring data privacy without directly sharing raw data. However, computation and communication overheads hinder the full fine-tuning of LLMs under the FL setting. Moreover, the distribution shift of legal data reduces the effectiveness of FL methods. To this end, in this paper, we propose the first Federated Legal Large Language Model (FedJudge) framework, which fine-tunes Legal LLMs efficiently and effectively. Specifically, FedJudge utilizes parameter-efficient fine-tuning methods to update only a few additional parameters during the FL training. Besides, we explore the continual learning methods to preserve the global model's important parameters when training local clients to mitigate the problem of data shifts. Extensive experimental results on three real-world datasets clearly validate the effectiveness of FedJudge. Code is released at https://github.com/yuelinan/FedJudge.

View on arXiv PDF Code

Similar