CLJun 22, 2020

Shared Task on Evaluating Accuracy in Natural Language Generation

arXiv:2006.12234v20.2

Originality Synthesis-oriented

AI Analysis

This addresses the need for standardized evaluation methods in NLG for researchers and practitioners, but it is incremental as it builds on existing shared task frameworks.

The paper introduces a shared task focused on evaluating the accuracy of natural language generation (NLG) systems, specifically using basketball game summaries generated from box score data.

We propose a shared task on methodologies and algorithms for evaluating the accuracy of generated texts. Participants will measure the accuracy of basketball game summaries produced by NLG systems from basketball box score data.

View on arXiv PDF

Similar