MLLGAPOct 12, 2015

Toward a Better Understanding of Leaderboard

arXiv:1510.03349v22 citations
AI Analysis

This work provides incremental improvements for competition organizers to enhance leaderboard reliability.

The paper addresses the problem of inaccurate leaderboards in machine learning competitions due to hacking and overfitting, proposing a simplified version of the Ladder leaderboard with a proof that sample complexity is cubic to the desired precision.

The leaderboard in machine learning competitions is a tool to show the performance of various participants and to compare them. However, the leaderboard quickly becomes no longer accurate, due to hack or overfitting. This article gives two pieces of advice to prevent easy hack or overfitting. By following these advice, we reach the conclusion that something like the Ladder leaderboard introduced in [blum2015ladder] is inevitable. With this understanding, we naturally simplify Ladder by eliminating its redundant computation and explain how to choose the parameter and interpret it. We also prove that the sample complexity is cubic to the desired precision of the leaderboard.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes