LGCVDec 11, 2025

Inverse problems with diffusion models: MAP estimation via mode-seeking loss

arXiv:2512.10524v21 citations
Originality Incremental advance
AI Analysis

This work addresses computational inefficiencies and modeling approximations in inverse problem solving for image restoration, representing an incremental improvement over existing methods.

The paper tackles the problem of solving inverse problems using pre-trained unconditional diffusion models without task-specific training, proposing a new MAP estimation strategy called VML-MAP that improves performance and reduces computational time. It demonstrates efficacy in diverse image-restoration tasks across multiple datasets, with concrete gains in speed and accuracy.

A pre-trained unconditional diffusion model, combined with posterior sampling or maximum a posteriori (MAP) estimation techniques, can solve arbitrary inverse problems without task-specific training or fine-tuning. However, existing posterior sampling and MAP estimation methods often rely on modeling approximations and can also be computationally demanding. In this work, we propose a new MAP estimation strategy for solving inverse problems with a pre-trained unconditional diffusion model. Specifically, we introduce the variational mode-seeking loss (VML) and show that its minimization at each reverse diffusion step guides the generated sample towards the MAP estimate (modes in practice). VML arises from a novel perspective of minimizing the Kullback-Leibler (KL) divergence between the diffusion posterior $p(\mathbf{x}_0|\mathbf{x}_t)$ and the measurement posterior $p(\mathbf{x}_0|\mathbf{y})$, where $\mathbf{y}$ denotes the measurement. Importantly, for linear inverse problems, VML can be analytically derived without any modeling approximations. Based on further theoretical insights, we propose VML-MAP, an empirically effective algorithm for solving inverse problems via VML minimization, and validate its efficacy in both performance and computational time through extensive experiments on diverse image-restoration tasks across multiple datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes