CL AIFeb 13, 2025

Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging -- An Open Recipe

Kunat Pipatanakul, Pittawat Taveekitworachai, Potsawee Manakul, Kasima Tharnpipitchai

arXiv:2502.09056v312.06 citationsh-index: 8

Originality Incremental advance

AI Analysis

This addresses the limitation of low-resource languages being underserved by English-centric reasoning models, enabling improved reasoning in language-specific LLMs, though it is incremental as it builds on existing model merging techniques.

The paper tackles the problem of enhancing reasoning capabilities in language-specific LLMs, such as Thai, by adapting models like DeepSeek R1, achieving performance matching DeepSeek R1 on reasoning tasks without compromising target language abilities using publicly available datasets and a $120 computational budget.

This paper investigates data selection and model merging methodologies aimed at incorporating advanced reasoning capabilities such as those of DeepSeek R1 into language-specific large language models (LLMs), with a particular focus on the Thai LLM. Our goal is to enhance the reasoning capabilities of language-specific LLMs while maintaining their target language abilities. DeepSeek R1 excels in reasoning but primarily benefits high-resource languages such as English and Chinese. However, low-resource languages remain underserved due to the dominance of English-centric training data and model optimizations, which limit performance in these languages. This limitation results in unreliable code-switching and diminished effectiveness on tasks in low-resource languages. Meanwhile, local and regional LLM initiatives have attempted to bridge this gap by developing language-specific LLMs that focus on improving local linguistic fidelity. We demonstrate that, with only publicly available datasets and a computational budget of $120, it is possible to enhance the reasoning capabilities of language-specific LLMs to match the level of DeepSeek R1, without compromising their performance on target language tasks.

View on arXiv PDF

Similar