CLJun 26, 2023

Fauno: The Italian Large Language Model that will leave you senza parole!

arXiv:2306.14457v120 citationsh-index: 19Has Code
Originality Synthesis-oriented
AI Analysis

This work democratizes the study of LLMs for Italian speakers by providing an open-source conversational model and datasets, though it is incremental as it applies existing methods to a new language.

The authors tackled the lack of open-source Italian conversational large language models by developing Fauno, the first and largest such model, which they fine-tuned on diverse datasets and made accessible with a single GPU, releasing code and datasets publicly.

This paper presents Fauno, the first and largest open-source Italian conversational Large Language Model (LLM). Our goal with Fauno is to democratize the study of LLMs in Italian, demonstrating that obtaining a fine-tuned conversational bot with a single GPU is possible. In addition, we release a collection of datasets for conversational AI in Italian. The datasets on which we fine-tuned Fauno include various topics such as general question answering, computer science, and medical questions. We release our code and datasets on \url{https://github.com/RSTLess-research/Fauno-Italian-LLM}

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes