CLMay 14

DiscoExplorer: An Open Interface for the Study of Multilingual Discourse Relations

arXiv:2605.1530497.3Has Code
Predicted impact top 4% in CL · last 90 daysOriginality Synthesis-oriented
AI Analysis

This tool addresses the need for accessible interfaces to analyze complex discourse relation data across languages, benefiting computational linguists and pragmatics researchers.

DiscoExplorer provides an open-source web interface for studying multilingual discourse relations, making datasets from the DISRPT Shared Task covering 16 languages publicly accessible with query, search, and visualization tools.

The relations connecting propositions in discourse such as cause (A because B) or concession (A although B) are a subject of intense interest in Computational Linguistics and Pragmatics, but challenging to study and compare across languages. Recent progress in standardizing discourse relation inventories across datasets offers the potential to facilitate such studies, but is hindered by the complexity of relevant data and the lack of easily accessible interfaces to analyze it. In this paper we present DiscoExplorer, a new open source web interface, capable of running on local computers, which we use to make datasets from the DISRPT Shared Task on discourse relation classification publicly available, covering 16 different languages. We present the query language, search and visualization facilities for relations and signaling devices such as connectives, as well as some example studies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes