JIANG: Chinese Open Foundation Language Model
This addresses the problem of suboptimal Chinese language capabilities in AI models for users and developers in Chinese-speaking domains, though it is incremental as it adapts existing methods to a new language focus.
The authors tackled the limited performance of existing large language models in Chinese by introducing JIANG, a model specifically designed for Chinese, which shows excellent performance in experiments.
With the advancements in large language model technology, it has showcased capabilities that come close to those of human beings across various tasks. This achievement has garnered significant interest from companies and scientific research institutions, leading to substantial investments in the research and development of these models. While numerous large models have emerged during this period, the majority of them have been trained primarily on English data. Although they exhibit decent performance in other languages, such as Chinese, their potential remains limited due to factors like vocabulary design and training corpus. Consequently, their ability to fully express their capabilities in Chinese falls short. To address this issue, we introduce the model named JIANG (Chinese pinyin of ginger) specifically designed for the Chinese language. We have gathered a substantial amount of Chinese corpus to train the model and have also optimized its structure. The extensive experimental results demonstrate the excellent performance of our model.