Co-creation of AI technology, empowering curators of cultural heritage information and guarding research commons
For curators of cultural heritage information, this work offers a method to enable AI-powered access to their collections, but the approach is incremental and lacks quantitative evaluation.
The paper describes the use of Retrieval-Augmented Generation (RAG) to create local chatbots for specific digital collections of cultural assets, culminating from a journey starting with 'archives for everyone' within the MuseIT project. No concrete performance numbers are provided.
The substance of this paper is the description of the use of Retrieval-Augmented Generation (RAG) for specific digital collections of cultural assets. The collections are provided by institutions operating in the cultural sector. The topical areas are the humanities and social sciences. More concretely, most of the work presented here was enabled by a European-funded research project MuseIT which is clearly situated in the realm of fostering new technologies for Cultural Heritage. We adhere to this interaction by presenting a sequence of our experimentations. This sequence is narrated as a specific journey of engineering all executed around a specific data-sharing and archiving platform Dataverse. Implementing a local chatbot for collections - a method also known as RAG in Information Retrieval - is the current culmination of this journey. The engineering journey we describe in the core of the paper starts from "archives for everyone" and ends with "local chatbots for specific collections".