CLJul 17, 2022

A Spoken Drug Prescription Dataset in French for Spoken Language Understanding

Ali Can Kocabiyikoglu, François Portet, Prudence Gibert, Hervé Blanchon, Jean-Marc Babouchkine, Gaëtan Gavazzi

arXiv:2207.08292v131.0584 citationsh-index: 30

Originality Synthesis-oriented

AI Analysis

This dataset addresses a gap for researchers and developers working on spoken medical dialogue systems, particularly in French, but is incremental as it extends existing text-based resources to a new modality and language.

The authors tackled the lack of speech corpora for medical drug prescription systems by creating PxSLU, the first spoken medical drug prescriptions corpus in French, containing 4 hours of transcribed and annotated dialogues from 55 participants, and demonstrated its utility for developing medical dialogue systems.

Spoken medical dialogue systems are increasingly attracting interest to enhance access to healthcare services and improve quality and traceability of patient care. In this paper, we focus on medical drug prescriptions acquired on smartphones through spoken dialogue. Such systems would facilitate the traceability of care and would free clinicians' time. However, there is a lack of speech corpora to develop such systems since most of the related corpora are in text form and in English. To facilitate the research and development of spoken medical dialogue systems, we present, to the best of our knowledge, the first spoken medical drug prescriptions corpus, named PxSLU. It contains 4 hours of transcribed and annotated dialogues of drug prescriptions in French acquired through an experiment with 55 participants experts and non-experts in prescriptions. We also present some experiments that demonstrate the interest of this corpus for the evaluation and development of medical dialogue systems.

View on arXiv PDF

Similar