CLLGJun 1, 2020

An Effectiveness Metric for Ordinal Classification: Formal Properties and Experimental Results

arXiv:2006.01245v11000 citations
Originality Incremental advance
AI Analysis

This work addresses a methodological gap for researchers and practitioners in fields like sentiment analysis by providing a more appropriate evaluation metric for ordinal classification, though it is incremental as it builds on existing measurement theory.

The paper tackles the problem of evaluating ordinal classification tasks, where existing metrics either ignore class ordering or assume absolute distances, by proposing a new metric called Closeness Evaluation Measure based on Measurement and Information Theory, with experimental results showing it captures quality aspects from different traditional tasks.

In Ordinal Classification tasks, items have to be assigned to classes that have a relative ordering, such as positive, neutral, negative in sentiment analysis. Remarkably, the most popular evaluation metrics for ordinal classification tasks either ignore relevant information (for instance, precision/recall on each of the classes ignores their relative ordering) or assume additional information (for instance, Mean Average Error assumes absolute distances between classes). In this paper we propose a new metric for Ordinal Classification, Closeness Evaluation Measure, that is rooted on Measurement Theory and Information Theory. Our theoretical analysis and experimental results over both synthetic data and data from NLP shared tasks indicate that the proposed metric captures quality aspects from different traditional tasks simultaneously. In addition, it generalizes some popular classification (nominal scale) and error minimization (interval scale) metrics, depending on the measurement scale in which it is instantiated.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes