Mirror Descent Using the Tempesta Generalized Multi-parametric Logarithms
This work provides an incremental improvement in optimization algorithms for machine learning by introducing a more adaptable family of Mirror Descent updates.
The authors tackled the problem of developing a flexible class of Mirror Descent algorithms by using the Tempesta multi-parametric logarithm as a link function, resulting in a method that can adapt to data geometry through tunable hyperparameters.
In this paper, we develop a wide class Mirror Descent (MD) algorithms, which play a key role in machine learning. For this purpose we formulated the constrained optimization problem, in which we exploits the Bregman divergence with the Tempesta multi-parametric deformation logarithm as a link function. This link function called also mirror function defines the mapping between the primal and dual spaces and is associated with a very-wide (in fact, theoretically infinite) class of generalized trace-form entropies. In order to derive novel MD updates, we estimate generalized exponential function, which closely approximates the inverse of the multi-parametric Tempesta generalized logarithm. The shape and properties of the Tempesta logarithm and its inverse-deformed exponential functions can be tuned by several hyperparameters. By learning these hyperparameters, we can adapt to distribution or geometry of training data, and we can adjust them to achieve desired properties of MD algorithms. The concept of applying multi-parametric logarithms allow us to generate a new wide and flexible family of MD and mirror-less MD updates.