CL AIAug 7, 2024

EXAONE 3.0 7.8B Instruction Tuned Language Model

Soyoung An, Kyunghoon Bae, Eunbi Choi, Stanley Jungkyu Choi, Yemuk Choi, Seokhee Hong, Yeonjung Hong, Junwon Hwang, Hyojin Jeon, Gerrard Jeongwon Jo, Hyunjik Jo, Jiyeon Jung

arXiv:2408.03541v411.222 citationsh-index: 18Has Code

Originality Synthesis-oriented

AI Analysis

This provides an open, bilingual language model for researchers and developers, but it is incremental as it builds on existing LLM families with instruction tuning.

The researchers introduced EXAONE 3.0, a 7.8B instruction-tuned language model, tackling the need for competitive open models with bilingual proficiency, achieving strong performance in Korean and general tasks against similar-sized state-of-the-art models.

We introduce EXAONE 3.0 instruction-tuned language model, the first open model in the family of Large Language Models (LLMs) developed by LG AI Research. Among different model sizes, we publicly release the 7.8B instruction-tuned model to promote open research and innovations. Through extensive evaluations across a wide range of public and in-house benchmarks, EXAONE 3.0 demonstrates highly competitive real-world performance with instruction-following capability against other state-of-the-art open models of similar size. Our comparative analysis shows that EXAONE 3.0 excels particularly in Korean, while achieving compelling performance across general tasks and complex reasoning. With its strong real-world effectiveness and bilingual proficiency, we hope that EXAONE keeps contributing to advancements in Expert AI. Our EXAONE 3.0 instruction-tuned model is available at https://huggingface.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct.

View on arXiv PDF

Similar