CLFeb 10, 2020

Automatic Discourse Segmentation: an evaluation in French

Rémy Saksik, Alejandro Molina-Villegas, Andréa Carneiro Linhares, Juan-Manuel Torres-Moreno

arXiv:2002.04095v20.21 citations

Originality Synthesis-oriented

AI Analysis

This work addresses discourse segmentation for French language processing, but it is incremental as it applies existing methods to a new language.

The paper tackled the problem of automatic discourse segmentation in French by developing three models based on multilingual resources like marker lists and POS tagging, and evaluated them against the Annodis corpus, achieving encouraging results.

In this article, we describe some discursive segmentation methods as well as a preliminary evaluation of the segmentation quality. Although our experiment were carried for documents in French, we have developed three discursive segmentation models solely based on resources simultaneously available in several languages: marker lists and a statistic POS labeling. We have also carried out automatic evaluations of these systems against the Annodis corpus, which is a manually annotated reference. The results obtained are very encouraging.

View on arXiv PDF

Similar