LG AIApr 12, 2021

Understanding Prediction Discrepancies in Machine Learning Classifiers

Xavier Renard, Thibault Laugel, Marcin Detyniecki

arXiv:2104.05467v26.514 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the issue for machine learning practitioners who need to choose between models with similar performance but different behaviors, potentially reducing arbitrary or unfair outcomes in classification tasks.

The paper tackles the problem of prediction discrepancies among machine learning classifiers that achieve similar performance but learn different patterns, which can lead to arbitrary or unfair decisions. It proposes a model-agnostic algorithm, DIG, to capture and explain these discrepancies locally, enabling practitioners to make informed model selections.

A multitude of classifiers can be trained on the same data to achieve similar performances during test time, while having learned significantly different classification patterns. This phenomenon, which we call prediction discrepancies, is often associated with the blind selection of one model instead of another with similar performances. When making a choice, the machine learning practitioner has no understanding on the differences between models, their limits, where they agree and where they don't. But his/her choice will result in concrete consequences for instances to be classified in the discrepancy zone, since the final decision will be based on the selected classification pattern. Besides the arbitrary nature of the result, a bad choice could have further negative consequences such as loss of opportunity or lack of fairness. This paper proposes to address this question by analyzing the prediction discrepancies in a pool of best-performing models trained on the same data. A model-agnostic algorithm, DIG, is proposed to capture and explain discrepancies locally, to enable the practitioner to make the best educated decision when selecting a model by anticipating its potential undesired consequences. All the code to reproduce the experiments is available.

View on arXiv PDF Code

Similar