CL LGJan 20, 2024

Orion-14B: Open-source Multilingual Large Language Models

Du Chen, Yi Huang, Xiaopu Li, Yongqiang Li, Yongqiang Liu, Haihui Pan, Leichao Xu, Dacheng Zhang, Zhipeng Zhang, Kun Han

arXiv:2401.12246v17.26 citationsHas Code

Originality Incremental advance

AI Analysis

This provides an open-source multilingual model for researchers and practitioners, though it appears incremental as it builds on existing large language model paradigms.

The study introduced Orion-14B, a collection of multilingual large language models with 14 billion parameters trained on 2.5 trillion tokens, achieving state-of-the-art performance across a broad spectrum of tasks.

In this study, we introduce Orion-14B, a collection of multilingual large language models with 14 billion parameters. We utilize a data scheduling approach to train a foundational model on a diverse corpus of 2.5 trillion tokens, sourced from texts in English, Chinese, Japanese, Korean, and other languages. Additionally, we fine-tuned a series of models tailored for conversational applications and other specific use cases. Our evaluation results demonstrate that Orion-14B achieves state-of-the-art performance across a broad spectrum of tasks. We make the Orion-14B model family and its associated code publicly accessible https://github.com/OrionStarAI/Orion, aiming to inspire future research and practical applications in the field.

View on arXiv PDF Code

Similar