CL LGJan 26, 2024

A Korean Legal Judgment Prediction Dataset for Insurance Disputes

Alice Saebom Kwak, Cheonkam Jeong, Ji Weon Lim, Byeongcheol Min

arXiv:2401.14654v11.02 citations

Originality Synthesis-oriented

AI Analysis

This work addresses a data scarcity problem for insurance companies and customers in Korea, but it is incremental as it applies an existing method to a new domain.

The paper tackles the lack of data for Korean legal judgment prediction in insurance disputes by introducing a new dataset and showing that Sentence Transformer Fine-tuning (SetFit) achieves performance similar to benchmark models with much less data.

This paper introduces a Korean legal judgment prediction (LJP) dataset for insurance disputes. Successful LJP models on insurance disputes can benefit insurance companies and their customers. It can save both sides' time and money by allowing them to predict how the result would come out if they proceed to the dispute mediation process. As is often the case with low-resource languages, there is a limitation on the amount of data available for this specific task. To mitigate this issue, we investigate how one can achieve a good performance despite the limitation in data. In our experiment, we demonstrate that Sentence Transformer Fine-tuning (SetFit, Tunstall et al., 2022) is a good alternative to standard fine-tuning when training data are limited. The models fine-tuned with the SetFit approach on our data show similar performance to the Korean LJP benchmark models (Hwang et al., 2022) despite the much smaller data size.

View on arXiv PDF

Similar