CLJan 5, 2024

Introducing Bode: A Fine-Tuned Large Language Model for Portuguese Prompt-Based Task

arXiv:2401.02909v122 citationsh-index: 13
Originality Synthesis-oriented
AI Analysis

This provides a free, improved LLM for Portuguese NLP tasks, addressing a gap for a low-resource language, but it is incremental as it builds on existing fine-tuning methods.

The authors tackled the problem of large language models underperforming on Portuguese prompts by fine-tuning LLaMA 2 to create Bode, a model available in 7B and 13B versions, which showed satisfactory results in zero-shot classification tasks compared to other LLMs.

Large Language Models (LLMs) are increasingly bringing advances to Natural Language Processing. However, low-resource languages, those lacking extensive prominence in datasets for various NLP tasks, or where existing datasets are not as substantial, such as Portuguese, already obtain several benefits from LLMs, but not to the same extent. LLMs trained on multilingual datasets normally struggle to respond to prompts in Portuguese satisfactorily, presenting, for example, code switching in their responses. This work proposes a fine-tuned LLaMA 2-based model for Portuguese prompts named Bode in two versions: 7B and 13B. We evaluate the performance of this model in classification tasks using the zero-shot approach with in-context learning, and compare it with other LLMs. Our main contribution is to bring an LLM with satisfactory results in the Portuguese language, as well as to provide a model that is free for research or commercial purposes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes