Pablo Ruiz-Ramírez

4.9CLMar 11, 2025

RigoChat 2: an adapted language model to Spanish using a bounded dataset and reduced hardware

Gonzalo Santamaría Gómez, Guillem García Subies, Pablo Gutiérrez Ruiz et al.

Large Language Models (LLMs) have become a key element of modern artificial intelligence, demonstrating the ability to address a wide range of language processing tasks at unprecedented levels of accuracy without the need of collecting problem-specific data. However, these versatile models face a significant challenge: both their training and inference processes require substantial computational resources, time, and memory. Consequently, optimizing this kind of models to minimize these requirements is crucial. In this article, we demonstrate that, with minimal resources and in a remarkably short time, it is possible to enhance a state-of-the-art model, specifically for a given language task, without compromising its overall capabilities using a relatively small pretrained LLM as a basis. Specifically, we present our use case, RigoChat 2, illustrating how LLMs can be adapted to achieve superior results in Spanish-language tasks.

Pablo Ruiz-Ramírez

1 Paper