ML LGMay 25

Rao-Blackwellized Score Matching on Manifolds

arXiv:2605.255679.2

Predicted impact top 86% in ML · last 90 daysOriginality Incremental advance

AI Analysis

For researchers in generative modeling on manifolds, this provides a principled framework to avoid numerical instability in score matching, though the results are theoretical and incremental.

The paper addresses the singularity in denoising score matching when data lies on a manifold, showing that conditioning on the nearest-point projection removes this singularity and yields an optimal predictor. The small-noise expansion reveals an intrinsic Riemannian score plus an order-σ² correction involving Tweedie and curvature terms.

We study denoising score matching (DSM) when the latent distribution is supported on a smooth embedded manifold $M \subset \mathbb{R}^D$. Under ambient Gaussian corruption, the tangent denoising target contains a singular normal-fiber noise channel whose variance diverges as $d/σ^2$ as $σ\to 0^+$. We show that conditioning on the nearest-point projection $π(X)$ canonically removes this singularity: the resulting conditional expectation is the unique $L^2$-optimal Rao-Blackwellized predictor of the tangent DSM target among all estimators depending only on the projected observation $π(X)$. We then compute the small-noise expansion of this canonical target and show that it equals the intrinsic Riemannian score up to an explicit order-$σ^2$ correction that decomposes into an intrinsic Tweedie term and an extrinsic curvature term involving the Weingarten and Ricci operators. In the flat case, the construction reduces exactly to ordinary lower-dimensional Gaussian DSM, while on $S^d$ the extrinsic correction simplifies to the scalar factor $(1-d/2)\nabla_M \log q$; this extrinsic $σ^2$ correction cancels identically on $S^2$, though the intrinsic Tweedie term remains.

View on arXiv PDF

Similar