Elena Irimia

30.8CLJun 16, 2022

An Open-Domain QA System for e-Governance

Radu Ion, Andrei-Marius Avram, Vasile Păiş et al.

The paper presents an open-domain Question Answering system for Romanian, answering COVID-19 related questions. The QA system pipeline involves automatic question processing, automatic query generation, web searching for the top 10 most relevant documents and answer extraction using a fine-tuned BERT model for Extractive QA, trained on a COVID-19 data set that we have manually created. The paper will present the QA system and its integration with the Romanian language technologies portal RELATE, the COVID-19 data set and different evaluations of the QA performance.

0.7CLNov 22, 2021

Human-Machine Interaction Speech Corpus from the ROBIN project

Vasile Păiş, Radu Ion, Andrei-Marius Avram et al.

This paper introduces a new Romanian speech corpus from the ROBIN project, called ROBIN Technical Acquisition Speech Corpus (ROBINTASC). Its main purpose was to improve the behaviour of a conversational agent, allowing human-machine interaction in the context of purchasing technical equipment. The paper contains a detailed description of the acquisition process, corpus statistics as well as an evaluation of the corpus influence on a low-latency ASR system as well as a dialogue component.

Elena Irimia

2 Papers