LGFeb 20

Generative Model via Quantile Assignment

Georgi Hrusanov, Oliver Y. Chén, Julien S. Bodelet

arXiv:2602.18216v11.4h-index: 23

Originality Incremental advance

AI Analysis

This provides a fast and stable solution for generating synthetic data, particularly beneficial for applications with limited training samples, though it appears incremental as it builds on existing generative modeling concepts.

The paper tackles the problem of training instability and computational overhead in deep generative models by introducing NeuroSQL, a new paradigm that eliminates auxiliary networks like encoders or discriminators, achieving lower mean pixel distance and faster training times compared to GANs, VAEs, and diffusion models on datasets such as MNIST and CelebA.

Deep Generative models (DGMs) play two key roles in modern machine learning: (i) producing new information (e.g., image synthesis) and (ii) reducing dimensionality. However, traditional architectures often rely on auxiliary networks such as encoders in Variational Autoencoders (VAEs) or discriminators in Generative Adversarial Networks (GANs), which introduce training instability, computational overhead, and risks like mode collapse. We present NeuroSQL, a new generative paradigm that eliminates the need for auxiliary networks by learning low-dimensional latent representations implicitly. NeuroSQL leverages an asymptotic approximation that expresses the latent variables as the solution to an optimal transportation problem. Specifically, NeuroSQL learns the latent variables by solving a linear assignment problem and then passes the latent information to a standalone generator. We benchmark its performance against GANs, VAEs, and a budget-matched diffusion baseline on four datasets: handwritten digits (MNIST), faces (CelebA), animal faces (AFHQ), and brain images (OASIS). Compared to VAEs, GANs, and diffusion models: (1) in terms of image quality, NeuroSQL achieves overall lower mean pixel distance between synthetic and authentic images and stronger perceptual/structural fidelity; (2) computationally, NeuroSQL requires the least training time; and (3) practically, NeuroSQL provides an effective solution for generating synthetic data with limited training samples. By embracing quantile assignment rather than an encoder, NeuroSQL provides a fast, stable, and robust way to generate synthetic data with minimal information loss.

View on arXiv PDF

Similar