LGMLDec 10, 2024

Phase-aware Training Schedule Simplifies Learning in Flow-Based Generative Models

arXiv:2412.07972v4h-index: 2
Originality Incremental advance
AI Analysis

This work addresses training inefficiencies in generative models for practitioners, offering incremental improvements in scheduling and feature-specific optimization.

The paper tackles the problem of training flow-based generative models on high-dimensional Gaussian mixtures by introducing a time dilation schedule that resolves the vanishing phase issue, enabling characterization of learning phases for mode probability and variance. It also proposes a method to identify optimal training time intervals for specific features, validated by preliminary experiments for more efficient training.

We analyze the training of a two-layer autoencoder used to parameterize a flow-based generative model for sampling from a high-dimensional Gaussian mixture. Previous work shows that the phase where the relative probability between the modes is learned disappears as the dimension goes to infinity without an appropriate time schedule. We introduce a time dilation that solves this problem. This enables us to characterize the learned velocity field, finding a first phase where the probability of each mode is learned and a second phase where the variance of each mode is learned. We find that the autoencoder representing the velocity field learns to simplify by estimating only the parameters relevant to each phase. Turning to real data, we propose a method that, for a given feature, finds intervals of time where training improves accuracy the most on that feature. Since practitioners take a uniform distribution over training times, our method enables more efficient training. We provide preliminary experiments validating this approach.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes