Evaluating NLG systems: A brief introduction
It addresses the problem of inadequate evaluation practices for NLG researchers, but it is incremental as it primarily provides an introductory overview.
The essay introduces evaluation in Natural Language Generation (NLG), explaining key terms and distinctions to encourage researchers to improve assessment methods, as highlighted by an INLG award for best evaluation.
This year the International Conference on Natural Language Generation (INLG) will feature an award for the paper with the best evaluation. The purpose of this award is to provide an incentive for NLG researchers to pay more attention to the way they assess the output of their systems. This essay provides a short introduction to evaluation in NLG, explaining key terms and distinctions.