CLMar 5, 2024

Breeze-7B Technical Report

arXiv:2403.02712v24 citationsh-index: 18Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of limited Traditional Chinese language models for chatbot applications, but it is incremental as it builds on an existing model.

The researchers tackled the need for better language comprehension and chatbot capabilities in Traditional Chinese by developing Breeze-7B, an open-source model based on Mistral-7B, which achieved top performance in several benchmarks for its complexity class.

Breeze-7B is an open-source language model based on Mistral-7B, designed to address the need for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese. This technical report provides an overview of the additional pretraining, finetuning, and evaluation stages for the Breeze-7B model. The Breeze-7B family of base and chat models exhibits good performance on language comprehension and chatbot-oriented tasks, reaching the top in several benchmarks among models comparable in its complexity class.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes