CVAILGJun 2

GuidedBridge: Training-freely Improving Bridge Models with Prior Guidance

arXiv:2606.0311993.8
Predicted impact top 10% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses the need for better guidance in bridge models for image translation, offering a training-free improvement that enhances prior exploitation.

The authors propose Prior Guidance (PG), a training-free method to improve bridge models by contrasting a weak unseen prior with a seen prior, and further develop frequency-modulated prior guidance (FMPG) and a cascaded CFG-FMPG framework. Experiments show consistent improvements across diverse image translation tasks.

Guidance methods, such as classifier-free guidance (CFG) and auto-guidance (AG), have advanced noise-to-data generation in diffusion models. Recently, bridge models have introduced a data-to-data generative process that can exploit an instructive clean prior. In this work, inspired by previous methods creating quality difference between denoising results as guidance, we propose a training-free bridge guidance method, termed Prior Guidance (PG). Specifically, we introduce a weak prior, which is unseen during bridge pre-training, hindering prior exploitation and thereby degrading denoising result. Then, we contrast it with the seen prior to highlight and enhance prior exploitation via a scaling factor. Moreover, we analyze the underlying mechanism of prior exploitation in the bridge process and design frequency-modulated prior guidance (FMPG), which tailors the guidance scale to low- and high-frequency bands coherent with bridge generative dynamics. To address prior exploitation in image in-painting, we develop a cascaded framework, CFG-FMPG, which first generates a noisy hidden representation via CFG and then exploits it as a generative prior with FMPG, fulfilling their complementary strengths without compromising inference efficiency. Experiments demonstrate that our PG methods consistently improve pre-trained bridge models across diverse image translation tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes