MLLGAPNov 6, 2024

Designing a Linearized Potential Function in Neural Network Optimization Using Csiszár Type of Tsallis Entropy

arXiv:2411.03611v1
Originality Incremental advance
AI Analysis

This work addresses a technical bottleneck in optimization theory for neural networks, offering an incremental improvement over existing entropy-based methods.

The paper tackles the challenge of generalizing entropy-based regularization in neural network optimization by introducing a linearized potential function using Csiszár-type Tsallis entropy, resulting in an exponential convergence to the optimizer.

In recent years, learning for neural networks can be viewed as optimization in the space of probability measures. To obtain the exponential convergence to the optimizer, the regularizing term based on Shannon entropy plays an important role. Even though an entropy function heavily affects convergence results, there is almost no result on its generalization, because of the following two technical difficulties: one is the lack of sufficient condition for generalized logarithmic Sobolev inequality, and the other is the distributional dependence of the potential function within the gradient flow equation. In this paper, we establish a framework that utilizes a linearized potential function via Csiszár type of Tsallis entropy, which is one of the generalized entropies. We also show that our new framework enable us to derive an exponential convergence result.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes