CLJun 27, 2025

MDC-R: The Minecraft Dialogue Corpus with Reference

arXiv:2506.22062v22 citationsh-index: 5
Originality Synthesis-oriented
AI Analysis

This provides a valuable annotated dataset for researchers in natural language processing, particularly for tasks involving reference resolution in situated dialogues, but it is incremental as it builds upon an existing corpus.

The authors introduced MDC-R, a new language resource that adds expert annotations of anaphoric and deictic reference to the existing Minecraft Dialogue Corpus, and demonstrated its usefulness for referring expression comprehension through a short experiment.

We introduce the Minecraft Dialogue Corpus with Reference (MDC-R). MDC-R is a new language resource that supplements the original Minecraft Dialogue Corpus (MDC) with expert annotations of anaphoric and deictic reference. MDC's task-orientated, multi-turn, situated dialogue in a dynamic environment has motivated multiple annotation efforts, owing to the interesting linguistic phenomena that this setting gives rise to. We believe it can serve as a valuable resource when annotated with reference, too. Here, we discuss our method of annotation and the resulting corpus, and provide both a quantitative and a qualitative analysis of the data. Furthermore, we carry out a short experiment demonstrating the usefulness of our corpus for referring expression comprehension.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes