LGCVFeb 3, 2025

Inverse Bridge Matching Distillation

arXiv:2502.01362v212 citationsh-index: 36Has CodeICML
Originality Highly original
AI Analysis

This addresses the practical deployment challenge of DBMs for image-to-image translation tasks, offering a significant speed-up while maintaining or enhancing performance.

The paper tackles the slow inference problem of diffusion bridge models (DBMs) by proposing a novel distillation technique based on inverse bridge matching, which accelerates inference from 4x to 100x and can improve generation quality in tasks like super-resolution and sketch-to-image.

Learning diffusion bridge models is easy; making them fast and practical is an art. Diffusion bridge models (DBMs) are a promising extension of diffusion models for applications in image-to-image translation. However, like many modern diffusion and flow models, DBMs suffer from the problem of slow inference. To address it, we propose a novel distillation technique based on the inverse bridge matching formulation and derive the tractable objective to solve it in practice. Unlike previously developed DBM distillation techniques, the proposed method can distill both conditional and unconditional types of DBMs, distill models in a one-step generator, and use only the corrupted images for training. We evaluate our approach for both conditional and unconditional types of bridge matching on a wide set of setups, including super-resolution, JPEG restoration, sketch-to-image, and other tasks, and show that our distillation technique allows us to accelerate the inference of DBMs from 4x to 100x and even provide better generation quality than used teacher model depending on particular setup. We provide the code at https://github.com/ngushchin/IBMD

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes