LGAIMay 20, 2025

$α$-GAN by Rényi Cross Entropy

arXiv:2505.14190v1h-index: 9
Originality Incremental advance
AI Analysis

This is an incremental improvement for generative modeling, potentially addressing common training issues like vanishing gradients in GANs.

The paper tackles the problem of slow convergence and vanishing gradients in GANs by proposing α-GAN, which uses Rényi cross entropy to formulate a min-max problem parameterized by α, with experiments showing faster convergence when α is in (0,1).

This paper proposes $α$-GAN, a generative adversarial network using Rényi measures. The value function is formulated, by Rényi cross entropy, as an expected certainty measure incurred by the discriminator's soft decision as to where the sample is from, true population or the generator. The discriminator tries to maximize the Rényi certainty about sample source, while the generator wants to reduce it by injecting fake samples. This forms a min-max problem with the solution parameterized by the Rényi order $α$. This $α$-GAN reduces to vanilla GAN at $α= 1$, where the value function is exactly the binary cross entropy. The optimization of $α$-GAN is over probability (vector) space. It is shown that the gradient is exponentially enlarged when Rényi order is in the range $α\in (0,1)$. This makes convergence faster, which is verified by experimental results. A discussion shows that choosing $α\in (0,1)$ may be able to solve some common problems, e.g., vanishing gradient. A following observation reveals that this range has not been fully explored in the existing Rényi version GANs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes