CLOct 16, 2023

JMedLoRA:Medical Domain Adaptation on Japanese Large Language Models using Instruction-tuning

Issey Sukeda, Masahiro Suzuki, Hiroki Sakaji, Satoshi Kodera

arXiv:2310.10083v23.315 citationsh-index: 14

Originality Incremental advance

AI Analysis

This work addresses the problem of domain adaptation for medical applications in Japanese, enabling institutions to fine-tune models independently, though it is incremental as it builds on existing LoRA and instruction-tuning methods.

The study tackled the challenge of adapting large language models to the medical domain by applying LoRA-based instruction-tuning to Japanese medical question-answering tasks, finding that it partially incorporates domain-specific knowledge with larger models showing more pronounced effects.

In the ongoing wave of impact driven by large language models (LLMs) like ChatGPT, the adaptation of LLMs to medical domain has emerged as a crucial research frontier. Since mainstream LLMs tend to be designed for general-purpose applications, constructing a medical LLM through domain adaptation is a huge challenge. While instruction-tuning is used to fine-tune some LLMs, its precise roles in domain adaptation remain unknown. Here we show the contribution of LoRA-based instruction-tuning to performance in Japanese medical question-answering tasks. In doing so, we employ a multifaceted evaluation for multiple-choice questions, including scoring based on "Exact match" and "Gestalt distance" in addition to the conventional accuracy. Our findings suggest that LoRA-based instruction-tuning can partially incorporate domain-specific knowledge into LLMs, with larger models demonstrating more pronounced effects. Furthermore, our results underscore the potential of adapting English-centric models for Japanese applications in domain adaptation, while also highlighting the persisting limitations of Japanese-centric models. This initiative represents a pioneering effort in enabling medical institutions to fine-tune and operate models without relying on external services.

View on arXiv PDF

Similar