Iria da Cunha

CL
3papers
4citations
Novelty10%
AI Score11

3 Papers

IRMay 1, 2020
Automatic Discourse Segmentation: Review and Perspectives

Iria da Cunha, Juan-Manuel Torres-Moreno

Multilingual discourse parsing is a very prominent research topic. The first stage for discourse parsing is discourse segmentation. The study reported in this article addresses a review of two on-line available discourse segmenters (for English and Portuguese). We evaluate the possibility of developing similar discourse segmenters for Spanish, French and African languages.

CLMar 11, 2017
Extending Automatic Discourse Segmentation for Texts in Spanish to Catalan

Iria da Cunha, Eric SanJuan, Juan-Manuel Torres-Moreno et al.

At present, automatic discourse analysis is a relevant research topic in the field of NLP. However, discourse is one of the phenomena most difficult to process. Although discourse parsers have been already developed for several languages, this tool does not exist for Catalan. In order to implement this kind of parser, the first step is to develop a discourse segmenter. In this article we present the first discourse segmenter for texts in Catalan. This segmenter is based on Rhetorical Structure Theory (RST) for Spanish, and uses lexical and syntactic information to translate rules valid for Spanish into rules for Catalan. We have evaluated the system by using a gold standard corpus including manually segmented texts and results are promising.

CLJan 6, 2015
Un résumeur à base de graphes, indépéndant de la langue

Juan-Manuel Torres-Moreno, Javier Ramirez, Iria da Cunha

In this paper we present REG, a graph-based approach for study a fundamental problem of Natural Language Processing (NLP): the automatic text summarization. The algorithm maps a document as a graph, then it computes the weight of their sentences. We have applied this approach to summarize documents in three languages.