CLFeb 10, 2020

Automatic Discourse Segmentation: an evaluation in French

arXiv:2002.04095v21 citations
AI Analysis

This work addresses discourse segmentation for French language processing, but it is incremental as it applies existing methods to a new language.

The paper tackled the problem of automatic discourse segmentation in French by developing three models based on multilingual resources like marker lists and POS tagging, and evaluated them against the Annodis corpus, achieving encouraging results.

In this article, we describe some discursive segmentation methods as well as a preliminary evaluation of the segmentation quality. Although our experiment were carried for documents in French, we have developed three discursive segmentation models solely based on resources simultaneously available in several languages: marker lists and a statistic POS labeling. We have also carried out automatic evaluations of these systems against the Annodis corpus, which is a manually annotated reference. The results obtained are very encouraging.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes