CL AIJan 8, 2024

TeleChat Technical Report

Zhongjiang He, Zihan Wang, Xinzhang Liu, Shixuan Liu, Yitong Yao, Yuyao Huang, Xuelong Li, Yongxiang Li, Zhonghao Che, Zhaoxi Zhang, Yan Wang, Xin Wang

arXiv:2401.03804v210.413 citationsh-index: 7Has Code

Originality Synthesis-oriented

AI Analysis

This provides an incremental open-source LLM option for researchers and developers, with support for English and Chinese tasks.

The authors introduced TeleChat, a collection of large language models in 3B, 7B, and 12B sizes, pretrained on trillions of tokens in English and Chinese and fine-tuned for human alignment, achieving comparable performance to similar-sized open-source models across various benchmarks.

In this technical report, we present TeleChat, a collection of large language models (LLMs) with parameters of 3 billion, 7 billion and 12 billion. It includes pretrained language models as well as fine-tuned chat models that is aligned with human preferences. TeleChat is initially pretrained on an extensive corpus containing a diverse collection of texts from both English and Chinese languages, including trillions of tokens. Subsequently, the model undergoes fine-tuning to align with human preferences, following a detailed methodology that we describe. We evaluate the performance of TeleChat on various tasks, including language understanding, mathematics, reasoning, code generation, and knowledge-based question answering. Our findings indicate that TeleChat achieves comparable performance to other open-source models of similar size across a wide range of public benchmarks. To support future research and applications utilizing LLMs, we release the fine-tuned model checkpoints of TeleChat's 7B and 12B variant, along with code and a portion of our pretraining data, to the public community.

View on arXiv PDF

Similar