CVMay 19

Distribution Matching Distillation without Fake Score Network

arXiv:2605.1925672.8
Predicted impact top 39% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For researchers working on few-step generative models, this work reduces memory and computational overhead by removing the need for a separate fake-score network, while maintaining or improving performance.

The paper proposes Fake-Score-network-Free DMD (FSF-DMD), which eliminates the auxiliary fake-score network in distribution matching distillation for few-step generation by using the generator's own pseudo-velocity as a proxy. On ImageNet-1K 256x256, FSF-DMD improves flow-map baselines and achieves lower FID than DMD2 comparisons in the flow-map-initialized setting.

Distribution Matching Distillation (DMD) provides an effective distribution-level correction for few-step generation, while relying on an auxiliary fake-score network to track the evolving generative distribution. Recent work combines DMD-style objectives with flow-map generators to exploit both forward-divergence training and reverse-divergence correction. The fake-score estimator remains an additional component with memory and update overhead. In this work, we study whether this explicit tracker can be avoided when the generator itself has a flow-map structure. We propose Fake-Score-network-Free DMD (FSF-DMD), a DMD formulation for flow-map generators that replaces the auxiliary fake-score estimator with a generator-induced pseudo-velocity surrogate. The key observation is that the endpoint pseudo-velocity of a flow-map generator provides a tractable proxy for fake-velocity estimation, allowing the generator itself to supply the reverse-divergence signal. Building on this observation, we derive a practical objective, extend it with flow-map-consistent backward simulation, and introduce a self-teacher variant for training from scratch. In our ImageNet-1K $256 \times 256$ experiments, FSF-DMD improves flow-map baselines, reaches lower FID than the listed DMD2 comparisons in the flow-map-initialized setting, and remains effective under flow-matching initialization and training from scratch.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes