LGSep 5, 2022

Full Kullback-Leibler-Divergence Loss for Hyperparameter-free Label Distribution Learning

Maurice Günder, Nico Piatkowski, Christian Bauckhage

arXiv:2209.02055v15.83 citationsh-index: 47

Originality Incremental advance

AI Analysis

This work addresses a hyperparameter tuning problem in label distribution learning, which is incremental as it modifies an existing method to improve usability.

The paper tackles the need for hyperparameters in Deep Label Distribution Learning (DLDL) by introducing a loss function based solely on Kullback-Leibler divergences, eliminating the requirement for hyperparameters and generalizing the method for multi-dimensional or multi-scale tasks.

The concept of Label Distribution Learning (LDL) is a technique to stabilize classification and regression problems with ambiguous and/or imbalanced labels. A prototypical use-case of LDL is human age estimation based on profile images. Regarding this regression problem, a so called Deep Label Distribution Learning (DLDL) method has been developed. The main idea is the joint regression of the label distribution and its expectation value. However, the original DLDL method uses loss components with different mathematical motivation and, thus, different scales, which is why the use of a hyperparameter becomes necessary. In this work, we introduce a loss function for DLDL whose components are completely defined by Kullback-Leibler (KL) divergences and, thus, are directly comparable to each other without the need of additional hyperparameters. It generalizes the concept of DLDL with regard to further use-cases, in particular for multi-dimensional or multi-scale distribution learning tasks.

View on arXiv PDF

Similar