DBMSolver: A Training-free Diffusion Bridge Sampler for High-Quality Image-to-Image Translation

arXiv:2605.0588965.6h-index: 1Has Code
AI Analysis

This work addresses the slow sampling bottleneck in diffusion-based image-to-image translation, offering a practical solution for real-world applications.

DBMSolver is a training-free sampler for Diffusion Bridge Models that reduces function evaluations by up to 5x while improving image quality, e.g., FID drops 53% on DIODE at 20 NFEs compared to a 2nd-order baseline, enabling efficient high-quality image-to-image translation.

Diffusion-based image-to-image (I2I) translation excels in high-fidelity generation but suffers from slow sampling in state-of-the-art Diffusion Bridge Models (DBMs), often requiring dozens of function evaluations (NFEs). We introduce DBMSolver, a training-free sampler that exploits the semi-linear structure of DBM's underlying SDE and ODE via exponential integrators, yielding highly-efficient 1st- and 2nd-order solutions. This reduces NFEs by up to 5x while boosting quality (e.g., FID drops 53% on DIODE at 20 NFEs vs. 2nd-order baseline). Experiments on inpainting, stylization, and semantics-to-image tasks across resolutions up to 256x256 show DBMSolver sets new SOTA efficiency-quality tradeoffs, enabling real-world applicability. Our code is publicly available at https://github.com/snumprlab/dbmsolver.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes