CVLGMay 20

Disentangling Generation and Regression in Stochastic Interpolants for Controllable Image Restoration

arXiv:2605.2138174.0
Predicted impact top 37% in CV · last 90 daysOriginality Incremental advance
AI Analysis

Provides a flexible framework for image restoration that balances pixel fidelity and perceptual quality, addressing the trade-off between generative and regression methods.

DiSI disentangles generation and regression in stochastic interpolants for image restoration, achieving competitive results with few-step inference and controllable distortion-perception trade-off within a single model.

Recent advances in Image Restoration (IR) have been largely driven by generative methods such as Diffusion Models and Flow Matching, which excel in synthesizing realistic textures while suffering from slow multi-step inference and compromised pixel fidelity. In contrast, classical regression-based IR methods excel precisely in these aspects, offering single-step efficiency and high pixel-level reconstruction fidelity. To bridge this gap, we propose DiSI, a unified framework that Disentangles the underlying Stochastic Interpolant process into independent generation and regression components. This decoupling endows DiSI with remarkable versatility, enabling a continuous and controllable transition from a pure regression process to a fully generative one. Technically, we instantiate this framework with two specific sampling trajectories, accompanied by a unified sampler for high-quality, few-step inference on arbitrary trajectories. Furthermore, we design a dual-branch U-Net style transformer network in pixel space, using a dedicated branch to enhance conditional guidance while ensuring high throughput. Extensive experiments demonstrate that DiSI efficiently achieves competitive results on various IR tasks, while uniquely offering the inference-time flexibility to control the distortion-perception trade-off within a single model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes