CLMar 12

Tiny Aya: Bridging Scale and Multilingual Depth

Alejandro R. Salamanca, Diana Abagyan, Daniel D'souza, Ammar Khairi, David Mora, Saurabh Dash, Viraat Aryabumi, Sara Rajaee, Mehrnaz Mofakhami, Ananya Sahu, Thomas Euyang, Brittawnya Prince

Microsoft

arXiv:2603.11510v115.210 citationsh-index: 67

Predicted impact top 1% in CL · last 90 daysOriginality Highly original

AI Analysis

This provides an efficient and balanced alternative for multilingual AI deployment, benefiting users in diverse regions by addressing scale and depth issues.

The paper tackled the challenge of creating a small multilingual language model that achieves state-of-the-art translation quality and strong multilingual understanding with only 3.35B parameters, trained on 70 languages and refined through region-aware posttraining.

Tiny Aya redefines what a small multilingual language model can achieve. Trained on 70 languages and refined through region-aware posttraining, it delivers state-of-the-art in translation quality, strong multilingual understanding, and high-quality target-language generation, all with just 3.35B parameters. The release includes a pretrained foundation model, a globally balanced instruction-tuned variant, and three region-specialized models targeting languages from Africa, South Asia, Europe, Asia-Pacific, and West Asia. This report details the training strategy, data composition, and comprehensive evaluation framework behind Tiny Aya, and presents an alternative scaling path for multilingual AI: one centered on efficiency, balanced performance across languages, and practical deployment.

View on arXiv PDF

Similar