CL LGFeb 3, 2020

CoTK: An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation

Fei Huang, Dazhen Wan, Zhihong Shao, Pei Ke, Jian Guan, Yilin Niu, Xiaoyan Zhu, Minlie Huang

arXiv:2002.00583v10.73 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This toolkit addresses practical issues in text generation research for developers and researchers, offering a standardized approach to reduce human errors and ensure fair comparisons, though it is incremental as it builds on existing evaluation methods.

The authors tackled the problem of inconsistent experimental settings and metric implementations in text generation evaluation, which leads to unfair assessments, by developing CoTK, an open-source toolkit that standardizes development steps and provides metric implementations to support fast development and fair evaluation.

In text generation evaluation, many practical issues, such as inconsistent experimental settings and metric implementations, are often ignored but lead to unfair evaluation and untenable conclusions. We present CoTK, an open-source toolkit aiming to support fast development and fair evaluation of text generation. In model development, CoTK helps handle the cumbersome issues, such as data processing, metric implementation, and reproduction. It standardizes the development steps and reduces human errors which may lead to inconsistent experimental settings. In model evaluation, CoTK provides implementation for many commonly used metrics and benchmark models across different experimental settings. As a unique feature, CoTK can signify when and which metric cannot be fairly compared. We demonstrate that it is convenient to use CoTK for model development and evaluation, particularly across different experimental settings.

View on arXiv PDF Code

Similar