CLJun 22, 2020

Shared Task on Evaluating Accuracy in Natural Language Generation

arXiv:2006.12234v2
Originality Synthesis-oriented
AI Analysis

This addresses the need for standardized evaluation methods in NLG for researchers and practitioners, but it is incremental as it builds on existing shared task frameworks.

The paper introduces a shared task focused on evaluating the accuracy of natural language generation (NLG) systems, specifically using basketball game summaries generated from box score data.

We propose a shared task on methodologies and algorithms for evaluating the accuracy of generated texts. Participants will measure the accuracy of basketball game summaries produced by NLG systems from basketball box score data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes