LG AIMay 8

Curated Synthetic Data Doesn't Have to Collapse: A Theoretical Study of Generative Retraining with Pluralistic Preferences

Ali Falahati, Mohammad Mohammadi Amiri, Kate Larson, Lukasz Golab

arXiv:2605.0772416.5

Predicted impact top 25% in LG · last 90 daysOriginality Highly original

AI Analysis

For researchers and practitioners training generative models on synthetic data, this work provides a theoretical foundation to avoid collapse without real data, challenging prior claims of inevitability.

The paper theoretically shows that recursive retraining of generative models on curated synthetic data does not inevitably collapse if curation uses multiple reward functions; under certain conditions, the model converges to a diverse distribution that satisfies a weighted Nash bargaining solution.

Recursive retraining of generative models poses a critical representation challenge: when synthetic outputs are curated based on a fixed reward signal, the model tends to collapse onto a narrow set of outputs that over-optimize that objective. Prior work suggests that such collapse is unavoidable without adding real data into the mix. We revisit this conclusion from an alignment perspective and show that collapse can be mitigated through curation based on multiple reward functions. We formalize the dynamics of recursive training under heterogeneous preferences and prove that, under certain conditions, the model converges to a stable distribution that allocates probability mass across competing high-reward regions. The limiting distribution preserves diversity and provably satisfies a weighted Nash bargaining solution, offering a formal interpretation of value aggregation in synthetic retraining loops.

View on arXiv PDF

Similar