ML LGNov 15, 2020

Deep Ordinal Regression using Optimal Transport Loss and Unimodal Output Probabilities

Uri Shaham, Igal Zaidman, Jonathan Svirsky

arXiv:2011.07607v25.89 citations

Originality Incremental advance

AI Analysis

This work addresses the need for reliable unimodal predictions in ordinal regression tasks, which is important for applications like medical diagnosis or rating systems, though it is incremental as it builds on existing unimodal and optimal transport concepts.

The authors tackled the problem of ensuring unimodal predictions in ordinal regression by proposing a framework that combines an architectural mechanism for guaranteed unimodal output distributions with an optimal transport loss to better capture class order. Experimental results on eight real-world datasets showed that their approach performs on par with or better than recent deep learning methods, while also being less overconfident than current baselines.

It is often desired that ordinal regression models yield unimodal predictions. However, in many recent works this characteristic is either absent, or implemented using soft targets, which do not guarantee unimodal outputs at inference. In addition, we argue that the standard maximum likelihood objective is not suitable for ordinal regression problems, and that optimal transport is better suited for this task, as it naturally captures the order of the classes. In this work, we propose a framework for deep ordinal regression, based on unimodal output distribution and optimal transport loss. Inspired by the well-known Proportional Odds model, we propose to modify its design by using an architectural mechanism which guarantees that the model output distribution will be unimodal. We empirically analyze the different components of our proposed approach and demonstrate their contribution to the performance of the model. Experimental results on eight real-world datasets demonstrate that our proposed approach consistently performs on par with and often better than several recently proposed deep learning approaches for deep ordinal regression with unimodal output probabilities, while having guarantee on the output unimodality. In addition, we demonstrate that proposed approach is less overconfident than current baselines.

View on arXiv PDF

Similar