CL LGJun 1, 2020

An Effectiveness Metric for Ordinal Classification: Formal Properties and Experimental Results

Enrique Amigó, Julio Gonzalo, Stefano Mizzaro, Jorge Carrillo-de-Albornoz

arXiv:2006.01245v131.11000 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses a methodological gap for researchers and practitioners in fields like sentiment analysis by providing a more appropriate evaluation metric for ordinal classification, though it is incremental as it builds on existing measurement theory.

The paper tackles the problem of evaluating ordinal classification tasks, where existing metrics either ignore class ordering or assume absolute distances, by proposing a new metric called Closeness Evaluation Measure based on Measurement and Information Theory, with experimental results showing it captures quality aspects from different traditional tasks.

In Ordinal Classification tasks, items have to be assigned to classes that have a relative ordering, such as positive, neutral, negative in sentiment analysis. Remarkably, the most popular evaluation metrics for ordinal classification tasks either ignore relevant information (for instance, precision/recall on each of the classes ignores their relative ordering) or assume additional information (for instance, Mean Average Error assumes absolute distances between classes). In this paper we propose a new metric for Ordinal Classification, Closeness Evaluation Measure, that is rooted on Measurement Theory and Information Theory. Our theoretical analysis and experimental results over both synthetic data and data from NLP shared tasks indicate that the proposed metric captures quality aspects from different traditional tasks simultaneously. In addition, it generalizes some popular classification (nominal scale) and error minimization (interval scale) metrics, depending on the measurement scale in which it is instantiated.

View on arXiv PDF Code

Similar