LG MLJun 8, 2020

All your loss are belong to Bayes

arXiv:2006.04633v25.06 citations

Originality Highly original

AI Analysis

This work addresses the challenge of avoiding biased ad hoc loss function choices in machine learning, offering a novel approach for domain-specific loss fitting.

The paper tackles the problem of fitting loss functions to specific domains by introducing a method using squared Gaussian Processes to generate compliant source functions, which leads to substantial improvements over state-of-the-art approaches.

Loss functions are a cornerstone of machine learning and the starting point of most algorithms. Statistics and Bayesian decision theory have contributed, via properness, to elicit over the past decades a wide set of admissible losses in supervised learning, to which most popular choices belong (logistic, square, Matsushita, etc.). Rather than making a potentially biased ad hoc choice of the loss, there has recently been a boost in efforts to fit the loss to the domain at hand while training the model itself. The key approaches fit a canonical link, a function which monotonically relates the closed unit interval to R and can provide a proper loss via integration. In this paper, we rely on a broader view of proper composite losses and a recent construct from information geometry, source functions, whose fitting alleviates constraints faced by canonical links. We introduce a trick on squared Gaussian Processes to obtain a random process whose paths are compliant source functions with many desirable properties in the context of link estimation. Experimental results demonstrate substantial improvements over the state of the art.

View on arXiv PDF

Similar