Fauno: The Italian Large Language Model that will leave you senza parole!
This work democratizes the study of LLMs for Italian speakers by providing an open-source conversational model and datasets, though it is incremental as it applies existing methods to a new language.
The authors tackled the lack of open-source Italian conversational large language models by developing Fauno, the first and largest such model, which they fine-tuned on diverse datasets and made accessible with a single GPU, releasing code and datasets publicly.
This paper presents Fauno, the first and largest open-source Italian conversational Large Language Model (LLM). Our goal with Fauno is to democratize the study of LLMs in Italian, demonstrating that obtaining a fine-tuned conversational bot with a single GPU is possible. In addition, we release a collection of datasets for conversational AI in Italian. The datasets on which we fine-tuned Fauno include various topics such as general question answering, computer science, and medical questions. We release our code and datasets on \url{https://github.com/RSTLess-research/Fauno-Italian-LLM}