CLLGMar 21, 2024

RakutenAI-7B: Extending Large Language Models for Japanese

arXiv:2403.15484v115 citationsh-index: 12
Originality Synthesis-oriented
AI Analysis

This work addresses the need for high-performing Japanese language models, though it is incremental as it extends existing methods to a specific domain.

The authors tackled the problem of developing Japanese-oriented large language models, resulting in RakutenAI-7B achieving the best performance on Japanese LM Harness benchmarks among open 7B models.

We introduce RakutenAI-7B, a suite of Japanese-oriented large language models that achieve the best performance on the Japanese LM Harness benchmarks among the open 7B models. Along with the foundation model, we release instruction- and chat-tuned models, RakutenAI-7B-instruct and RakutenAI-7B-chat respectively, under the Apache 2.0 license.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes