CLLGJan 20, 2024

Orion-14B: Open-source Multilingual Large Language Models

arXiv:2401.12246v16 citationsHas Code
Originality Incremental advance
AI Analysis

This provides an open-source multilingual model for researchers and practitioners, though it appears incremental as it builds on existing large language model paradigms.

The study introduced Orion-14B, a collection of multilingual large language models with 14 billion parameters trained on 2.5 trillion tokens, achieving state-of-the-art performance across a broad spectrum of tasks.

In this study, we introduce Orion-14B, a collection of multilingual large language models with 14 billion parameters. We utilize a data scheduling approach to train a foundational model on a diverse corpus of 2.5 trillion tokens, sourced from texts in English, Chinese, Japanese, Korean, and other languages. Additionally, we fine-tuned a series of models tailored for conversational applications and other specific use cases. Our evaluation results demonstrate that Orion-14B achieves state-of-the-art performance across a broad spectrum of tasks. We make the Orion-14B model family and its associated code publicly accessible https://github.com/OrionStarAI/Orion, aiming to inspire future research and practical applications in the field.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes