LGAICVMay 23

Beyond Generative Priors: Minority Sampling with JEPA-Guided Diffusion

arXiv:2605.2463116.3Has Code
Predicted impact top 35% in LG · last 90 daysOriginality Highly original
AI Analysis

It addresses the problem of generating semantically meaningful minority samples for applications like medical diagnosis and anomaly detection by shifting from generator-centric to world-centric rarity.

The paper introduces JEPA guidance, a diffusion sampling framework that uses a Joint-Embedding Predictive Architecture to generate minority samples aligned with real-world semantic rarity, outperforming generator-centric baselines in fidelity and semantic validity across unconditional, class-conditional, and text-to-image generation.

Minority sampling aims to generate low-density instances on a data manifold and is of central importance in applications such as medical diagnosis, anomaly detection, and creative AI. Existing approaches, however, define minority samples relative to generative priors learned from training data, confining rarity to model-specific notions that may poorly reflect real-world semantics. In this work, we propose a world-centric perspective on minority sampling, which defines rarity with respect to real-world priors rather than generator-induced densities. To this end, we introduce JEPA guidance, a diffusion sampling framework guided by a Joint-Embedding Predictive Architecture (JEPA) -- a class of world models that encode broad, semantically rich representations. JEPA guidance steers diffusion trajectories toward low-density regions under the implicit density induced by the JEPA, thereby aligning generated minorities with real-world semantic rarity. To make JEPA guidance computationally practical, we develop principled approximation strategies accompanied by theoretical error bounds, significantly reducing the overhead of guidance computation. Extensive experiments across unconditional, class-conditional, and text-to-image generation demonstrate that JEPA guidance consistently improves the fidelity and semantic validity of minority samples, outperforming generator-centric baselines in capturing real-world notions of rarity. Code is available at https://github.com/soobin-um/jepa-guidance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes